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Alzheimer's Disease Secretase, APP Substrates Therefor, and Uses Therefor 

The present application is a continuation of United States Application 
Serial No. 09/416,901, filed October 13, 1999 which claims priority benefit of United 
5 Stales Provisional Patent Application No. 60/1 55,493, filed September 23, 1 999. The 
present application also claims priority benefit as a continuation*in-par1 of United 
States Patent Application Serial No. 09/404,133 and PCT/US99/20881, both filed 
September 23, 1999, both of which in turn claim priority benefit of United States 
Provisional Patent Application No. 60/101,594, filed September 24, 1998. All of 
1 0 these priority applications are hereby incorporated by reference in their entirety. 

FIELD OF THE INVENTION 

The present invention relates to Alzheimer's Disease, amyloid protein 
precursor, amyloid beta peptide, and human aspartyl proteases, as well as a method for 
1 5 the identification of agents that modulate the activity of these polypeptides and 
thereby are candidates to modulate the progression of Alzheimer's disease. 

BACKGROUND OF THE INVENTION 
Alzheimer's disease (AD) causes progressive dementia with consequent 

20 formation of amyloid plaques, neurofibrillary tangles, ghosis and neuronal loss. The 
disease occurs in both genetic and sporadic forms whose clinical course and 
pathological features are quite similar. Three genes have been discovered to dale 
which, when mutated, cause an autosomal dominant form of Alzheimer's disease. 
These encode the amyloid protein precursor (APP) and two related proteins, 

25 presenilin-I (PSl) and presenilin-2 (PS2), which, as their names suggest, are 

structurally and functionally related. Mutations in any of the three proteins have been 
observed to enhance proteolytic processing of APP via an intracellular pathway that 
produces amyloid beta peptide (AP peptide ,or sometimes here as Abeta), a 40-42 
amino acid long peptide that is the primary component of amyloid plaque in AD. 
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Dysregulation of intracellular pathways for proteolytic processing may be 
central to the pathophysiology of AD. In the case of plaque formation, mutations in 
APP, PSl or PS2 consistently alter the proteolytic processing of APP so as to enhance 
formation of Ap 1-42, a form of the Ap peptide which seems to be particularly 
5 amyloidogenic, and thus very important in AD. Different forms of APP range in size 
from 695-770 amino acids, localize to the cell surface, and have a single C-terminal 
transmembrane domain. Examples of specific isotypes of APP which are currently 
known to exist in humans are the 695-amino acid polypeptide described by Kang et. 
al (1987), Nature 325: 733-736 which is designated as the "normal" APP; the 751 

1 0 amino acid polypeptide described by Ponte et at, ( 1 988), Nature 33 J : 525-527 ( 1 988) 
and Tanzi et ai {\9%%), Nature 331: 528-530; and the 770 amino acid polypeptide 
described by Kitaguchi et, al. (1988), Nature 331: 530-532. The Abeta peptide is 
derived from a region of APP adjacent to and containing a portion of the 
transmembrane domain. Normally, processing of APP at the a-secretase site cleaves 

1 5 the midregion of the Ap sequence adjacent to the membrane and releases the soluble, 
extracellular domain of APP from the cell surface. This a-secretase APP processing 
creates soluble APP- a, which is normal and not thought to contribute to AD. 
Pathological processing of APP at the p- and y-secretase sites, which are located N- 
terminal and C-terminal to the a-secretase site, respectively, produces a very different 

20 result than processing at the a site. Sequential processing at the p- and y-secretase 
sites releases the AP peptide, a peptide possibly very important in AD pathogenesis. 
Processing at the P- and y-secretase sites can occur in both the endoplasmic reticulum 
(in neurons) and in the endosomal/lysosomal pathway after reintemalization of cell 
surface APP (in all cells). Despite intense efforts, for 10 years or more, to identify the 

25 enzymes responsible for processing APP at the P and y sites, to produce the AP 
peptide, those proteases remained unknown until this disclosure. 

SUMMARY OF THE INVENTION 

Here, for the first time, we report the identification and characterization of the 
30 p secretase enzyme, termed Aspartyl Protease 2 (Asp2). We disclose some known 
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and some novel human aspartic proteases that can act as P-secretase proteases and, for 
the first time, we explain the role these proteases have in AD. We describe regions in 
the proteases critical for their unique function and for the first time characterize their 
substrate. This is the first description of expressed isolated purified active protein of 
5 this type, assays that use the protein, in addition to the identification and creation of 
useful cell lines and inhibitors. 

Here we disclose a number of variants of the Asp2 gene and peptide. 
In one aspect, the invention provides any isolated or purified nucleic acid, 
polynucleotide that codes for a protease capable of cleaving the beta (P) secretase 

10 cleavage site of APP that contains two or more sets of special nucleic acids, where 
the special nucleic acids are sq)arated by nucleic acids that code for about 100 to 300 
amino acid positions, where the amino acids in those positions may be any amino 
acids, where the first set of special nucleic acids consists of the nucleic acids that code 
for the peptide DTG, where the first nucleic acid of the first special set of nucleic 

IS acids is the first special nucleic acid, and where the second set of nucleic acids code 
for either the peptide DSG or DTG, where the last nucleic acid of the second set of 
nucleic acids is the last special nucleic acid, with the proviso that the nucleic acids 
disclosed in SEQ ED NO. 1 and SEQ ID NO. 3 are not included. In a preferred 
embodiment, the two sets of special nucleic acids are separated by nucleic acids that 

20 code for about 1 25 to 222 amino acid positions, which may be any amino acids. In a 
highly preferred embodiment, the two sets of special nucleic acids are separated by 
nucleic acids that code for about 150 to 196, or 150-190, or 150 to 172 amino acid 
positions, which may be any amino acids. In a particular preferred embodiment, the 
two sets are separated by nucleic acids that code for about 1 72 amino acid positions, 

25 which may be any amino acids. An exemplary nucleic acid polynucleotide comprises 
the acid nucleotide sequence in SEQ ID NO. 5. In another particular preferred 
embodiment, the two sets are separated by nucleic acids that code for about 196 amino 
acids. An exemplary polynucleotide comprises the nucleotide sequence in SEQ ID 
NO. 5. In another particular embodiment, the two sets of nucleotides are separated by 

30 nucleic acids that code for about 1 90 amino acids. An exemplary polynucleotide 
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comprises the nucleotide sequence in SEQ ID NO. 1. Preferably, the first nucleic acid 
of the first special set of amino acids, that is, the first special nucleic acid, is operably 
linked to any codon where the nucleic acids of that codon codes for any peptide 
comprising from 1 to 10,000 amino acid (positions). In one variation, the first special 
5 nucleic acid is operably linked to nucleic acid polymers that code for any peptide 
selected from the group consisting of: any reporter proteins or proteins which 
facilitate purification. For example, the first special nucleic acid is operably linked to 
nucleic acid polymers that code for any peptide selected from the group consisting of: 
immunoglobin-heavy chain, maltose binding protein, glutathione S transferase, Green 

10 Fluorescent protein, and ubiquitin. In another variation, the last nucleic acid of the 
second set of special amino acids, that is, Ihe.lasl special nucleic acid, is operably 
linked to nucleic acid polymers that code for any peptide comprising any amino acids 
from 1 to 10,000 amino acids. In still another variation, the last special nucleic acid is 
operably linked to nucleic acid polymers that code for any peptide selected from the 

1 5 group consisting of: any reporter proteins or proteins which facilitate purification. For 
example, the last special nucleic acid is operably linked to nucleic acid polymers that 
code for any peptide selected from the group consisting of: immunoglobin-heavy 
chain, maltose binding protein, glutathione S transferase. Green Fluorescent protein, 
and ubiquitin. 

20 In a related aspect, the invention provides any isolated or purified nucleic acid 

polynucleotide that codes for a protease capable of cleaving the beta secretase 
cleavage site of APP that contains two or more sets of special nucleic acids, where the 
special nucleic acids are separated by nucleic acids that code for about 100 to 300 
amino acid positions, where the amino acids in those positions may be any amino 

25 acids, where the first set of special nucleic acids consists of the nucleic acids that code 
for DIG, where the first nucleic acid of the first special set of nucleic acids is the first 
special nucleic acid, and where the second set of nucleic acids code for either DSG or 
DTG, where the last nucleic acid of the second set of special nucleic acids is the last 
special nucleic acid, where the first special nucleic acid is operably linked to nucleic 

30 acids that code for any number of amino acids from zero to 81 amino acids and where 
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each of those codons may code for any amino acid. In a preferred embodiment, the 
first special nucleic acid is operably linked to nucleic . acids that code for any number 
of from 64 to 77 amino acids where each codon may code for any amino acid. In a 
particular embodiment, the first special nucleic acid is operably linked to nucleic acids 
5 that code for 71 amino acids. For example, the first special nucleic acid is operably 
linked to 71 amino acids and where the first of those 71 amino acids is the amino acid 
T. In a preferred embodiment, the polynucleotide comprises a sequence that is at least 
95% identical to a human Aspl or Asp2 sequence as taught herein. In another 
preferred embodiment, the first special nucleic acid is operably linked to nucleic acids 

10 that code for any number of from 30 to 54 amino acids, or 35 to 47 amino acids, or 40 
to 54 amino acids where each codon may code for any amino acid. In a particular 
embodiment, the first special nucleic acid is operably linked to nucleic acids that 
code for 47 amino acids. For example, the first special nucleic acid is operably linked 
to 47 codons where the first those 47 amino acids is the amino acid E. 

15 In another related aspect, the invention provides for any isolated or purified 

nucleic acid polynucleotide that codes for a protease capable of cleaving the beta (P) 
secretase cleavage site of APP and that contains two or more sets of special nucleic 
acids, where the special nucleic acids are separated by nucleic acids that code for 
about 100 to 300 amino acid positions, where the amino acids in those positions may 

20 be any amino acids, where the first set of special nucleic acids consists of the nucleic 
acids that code for the peptide DIG, where the first nucleic acid of the first special set 
of amino acids is, the first special nucleic acid, and where the second set of special 
nucleic acids code for either the peptide DSG or DTG, where the last nucleic acid of 
the second set of special nucleic acids, the last special nucleic acid, is operably linked 

25 to nucleic acids that code for any number of codons fi*om 50 to 1 70 codons. In a 

preferred embodiment, the last special nucleic acid is operably linked to nucleic acids 
comprising from 100 to 170 codons. In a highly preferred embodiment, the last 
special nucleic acid is operably linked to nucleic acids comprising from 142 to 163 
codons. bi a particular embodiment, the last special nucleic acid is operably linked to 

30 nucleic acids comprising about 142 codons, or about 1 63 codons, or about 1 70 
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codons. In a highly preferred embodiment, the polynucleotide comprises a sequence 
that is at least 95% identical to aspartyi-protease encoding sequences taught herein. In 
one variation, the second set of special nucleic acids code for the peptide DSG. In 
another variation, the first set of nucleic acid polynucleotide is operably linked to a 
5 peptide purification tag. For example, the nucleic acid polynucleotide is operably 

linked to a peptide purification tag v^hich is six histidine. In still another variation, the 
first set of special nucleic acids are on one polynucleotide and the second set of 
special nucleic acids are on a second polynucleotide, where both first and second 
polynucleotides have at lease 50 codons. In one embodiment of this type, both of the 

1 0 polynucleotides are in the same solution. In a related aspect, the invention provides a 
vector which contains a polynucleotide as described above, or a cell or cell line which 
is transformed or transfected with a polynucleotide as described above or with a 
vector containing such a polynucleotide. 

In still another aspect, the invention provides an isolated or purified peptide or 

15 protein comprising an amino acid polymer that is a protease capable of cleaving the 

beta (P) secretase cleavage site of APP that contains two or more sets of special amino 
acids, where the special amino acids are separated by about 100 to 300 amino acid 
positions, where each amino acid position can be any amino acid, where the first set 
of special amino acids consists of the peptide DTG, where the first amino acid of the 

20 fiirst special set of amino acids is, the first special amino acid, where the second set of 
amino acids is selected fi-om the peptide comprising either DSG or DTG, where the 
last amino acid of the second set of special amino acids is the last special amino acid, 
with the proviso that the proteases disclosed in SEQ ID NO. 2 and SEQ ID NO. 4 are 
not included. In preferred embodiments, the two sets of amino acids are separated by 

25 about 1 25 to 222 amino acid positions or about 1 50 to 1 96 amino acids, or about 1 50- 
190 amino acids, or about 150 to 172 amino acids, where in each position it may be 
any amino acid. In a particular embodiment, the two sets of amino acids are separated 
by about 1 72 amino acids. For example, the protease has the amino acid sequence 
described in SEQ ID NO 6. In another particular embodiment, the two sets of amino 

30 acids are separated by about 1 96 amino acids. For example, the two sets of amino 
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acids are separated by the same amino acid sequences that separate the same set of 
special amino acids in SEQ ID NO 4. In another particular embodiment, the two sets 
of nucleotides are separated by about 1 90 amino acids. For example, the two sets of 
nucleotides are separated by the same amino acid sequences that separate the same set 
5 of special amino acids in SEQ ID NO 2. In one embodiment, the first amino acid of 
the first special set of amino acids, that is, the first special amino acid, is operably 
linked to any peptide comprising firom 1 to 10,000 amino acids. In another 
embodiment, the first special amino acid is operably linked to any peptide selected 
from the group consisting of: any reporter proteins or proteins which facilitate 

10 purification. In particular embodiments, the first special amino acid is operably linked 
to any peptide selected from the group consisting of: immunoglobin-heavy chain, 
maltose binding protein, glutathione S transferase. Green Fluorescent protein, and 
ubiquitin. In still another variation, the last amino acid of the second set of special 
amino acids, that is, the last special amino acid, is operably linked to any peptide 

15 comprising any amino acids from 1 to 10,000 amino acids. By way of nonlimiting 
example, the last special amino acid is operably linked any peptide selected from the 
group consisting of any reporter proteins or proteins which facilitate purification. In 
particular embodiments, the last special amino acid is operably linked to any peptide 
selected from the group consisting of: immunoglobin-heavy chain, maltose binding 

20 protein, glutathione S transferase. Green Fluorescent protein, and ubiquitin. 

In a related aspect, the invention provides any isolated or purified peptide or 
protein comprising an amino acid polypeptide that codes for a protease capable of 
cleaving the beta secretase cleavage site of APP that contains two or more sets of 
special amino acids, where the special amino acids are separated by about 100 to 300 

25 amino acid positions, where each amino acid in each position can be any amino acid, 
where the first set of special amino acids consists of the amino acids DTG, where the 
first amino acid of the first special set of amino acids is, the first special amino acid, 
D, and where the second set of amino acids is either DSG or DTG, where the last 
amino acid of the second set of special amino acids is the last special amino acid, G, 

30 where the first special amino acid is operably linked to amino acids that code for any 
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number of amino acids from zero to 81 amino acid positions where in each position it 
may be any amino acid. In a preferred embodiment, the first special amino acid is 
operably linked to a peptide from about 30-77 or about 64 to 77 amino acids positions 
where each amino acid position may be any amino acid. In a particular embodiment, 
5 the first special amino acid is operably linked to a peptide 35, 47, 71, or 77 amino 

acids. In a very particular embodiment, the first special amino acid is operably linked 
to 71 amino acids and the first of those 71 amino acids is the amino acid T. For 
example, the polypeptide comprises a sequence that is at least 95% identical to an 
aspartyl protease sequence as described herein. In another embodiment, the first 

1 0 special amino acid is operably linked to any number of from 40 to 54 amino acids 
(positions) where each amino acid position may be any amino acid. In a particular 
embodiment, the first special amino acid is operably linked to amino acids that code 
for a peptide of 47 amino acids, fa a very particular embodiment, the first special 
amino acid is operably linked to a 47 amino acid peptide where the first those 47 

1 S amino acids is the amino acid E. fa another particular embodiment, the first special 
amino acid is operably linked to the same corresponding peptides from SEQ ID NO. 3 
that are 35, 47, 71, or 77 peptides in length, beginning counting with the amino acids 
on the first special sequence, DTG, towards the N-terminal of SEQ ID NO. 3. fa 
another particular embodiment, the polypeptide comprises a sequence that is at least 

20 95% identical to the same corresponding amino acids in SEQ ID NO. 4, that is, 
identical to that portion of the sequences in SEQ ID NO. 4, including all the 
sequences from both the first and or the second special nucleic acids, toward the - 
terminal, through and including 71, 47, 35 amino acids before the first special amino 
acids. For example, the complete polypeptide comprises the peptide of 71 amino 

25 acids, where the first of the amino acid is T and the second is Q. 

fa still another related aspect, the invention provides any isolated or purified 
amino acid polypeptide that is a protease capable of cleaving the beta (P) secretase 
cleavage site of APP that contains two or more sets of special amino acids, where the 
special amino acids are separated by about 100 to 300 amino acid positions, where 

30 each amino acid in each position can be any amino acid, where the first set of special 
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amino acids consists of the amino acids that code for DTG, where the first amino acid 
of the first special set of amino acids is, the first special amino acid, D, and where the 
second set of amino acids are either DSG or DTG, where the last amino acid of the 
second set of special amino acids is the last special amino acid, G, which is operably 

5 linked to any number of amino acids from 50 to 1 70 amino acids, which may be any 
amino acids. In preferred embodiments, the last special amino acid is operably linked 
to a peptide of about 100 to 170 amino acids or about 142-163 amino acids. In 
particular embodiments, the last special amino acid is operably linked to a peptide of 
about 142 amino acids, or about 163 amino acids, or about 1 70 amino acids. For 

10 example, the polypeptide comprises a sequence that is at least 95% identical (and 
preferably 1 00% identical) to an aspartyl protease sequence as described herein. In 
one particular embodiment, the second set of special amino acids is comprised of the 
peptide with the amino acid sequence DSG. Optionally, the amino acid polypeptide 
is operably linked to a peptide purification tag, such as purification tag which is six 

IS histidine. In one variation, the first set of special amino acids are on one polypeptide 
and the second set of special amino acids are on a second polypeptide, where both 
first and second polypeptide have at lease 50 amino acids, which may be any amino 
acids. In one embodiment of this type, both of the polypeptides are in the same 
vessel. The invention further includes a process of making any of the polynucleotides, 

20 vectors, or cells described herein; and a process of making any of the polypeptides 
described herein. 

In yet another related aspect, the invention provides a purified polynucleotide 
comprising a nucleotide sequence that encodes a polypeptide having aspartyl protease 
activity, wherein the polypeptide has an amino acid sequence characterized by: (a) a 

25 first tripeptide sequence DTG; (b) a second tripeptide sequence selected from the 

group consisting of DSG and DTG; and (c) about 100 to 300 amino acids separating 
the first and second tripeptide sequences, wherein the polypeptide cleaves the beta 
secretase cleavage site of amyloid protein precursor. In one embodiment, the 
polypeptide comprises an amino acid sequence depicted in SEQ ID NO: 2 or 4, 

30 whereas in another embodiment, the polypeptide comprises an amino acid sequence 
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Other ihan the amino acid sequences set forth in SEQ ID NOs: 2 and 4. Similarly, the 
invention provides a purified polynucleotide comprising a nucleotide sequence that 
encodes a polypeptide that cleaves the beta secretase cleavage site of amyloid protein 
precursor; wherein the polynucleotide includes a strand that hybridizes to one or more 
5 of SEQ ID NOs: 3, 5, and 7 under the following hybridization conditions: 

hybridization overnight at 42°C for 2.5 hours in 6 X SSC/0.1% SDS, followed by 
washing in 1 .0 X SSC at 65°C, 0.1% SDS. In one embodiment, the polypeptide 
comprises an amino acid sequence depicted in SEQ ID NO: 2 or 4, whereas in another 
embodiment, the polypeptide comprises an amino acid sequence other than the amino 

1 0 acid sequences set forth in SEQ ID NOs: 2 and 4. Likewise, the invention provides a 
purified polypeptide having aspartyl protease activity, wherein the polypeptide is 
encoded by polynucleotides as described in the preceding sentences. The invention 
also provides a vector or host cell comprising such polynucleotides, and a method of 
making the polypeptides using the vectors or host cells to recombinantly express the 

15 polypeptide. 

In yet another aspect, the invention provides an isolated nucleic acid molecule 
comprising a polynucleotide, said polynucleotide encoding a Hu-Asp polypeptide and 
having a nucleotide sequence at least 95% identical to a sequence selected from the 
group consisting of: 

20 (a) a nucleotide sequence encoding a Hu-Asp polypeptide selected 

from the group consisting of Hu-Aspl, Hu-Asp2(a), and Hu-Asp2(b), wherein said 
Hu-Asp 1, Hu-Asp2(a) and Hu-Asp2(b) polypeptides have the complete amino acid 
sequence of SEQ ID NO. 2, SEQ ID NO. 4, and SEQ ID NO. 6, respectively; and 
(b) a nucleotide sequence complementary to the nucleotide 

25 sequence of (a). 

Several species are particularly contemplated. For example, the invention 
provides a nucleic acid and molecule wherein said Hu-Asp polypeptide is Hu-Aspl, 
and said polynucleotide molecule of 1 (a) comprises the nucleotide sequence of SEQ 
ID NO. 1; and a nucleic acid molecule wherein said Hu-Asp polypeptide is 

30 Hu-Asp2(a), and said polynucleotide molecule of 1 (a) comprises the nucleotide 

-10- 



wo 01/49097 



PCT/IBOl/00797 



sequence of SEQ ID NO. 4; and a nucleic acid molecule wherein said Hu-Asp 
polypeptide is Hu-Asp2(b), and said polynucleotide molecule of 1(a) comprises the 
nucleotide sequence of SEQ ID NO. 5. In addition to ihc foregoing, the invention 
provides an isolated nucleic acid molecule comprising polynucleotide which 
5 hybridizes under stringent conditions to a polynucleotide having the nucleotide 
sequence in (a) or (b) as described above. 

Additionally, the invention provides a vector comprising a nucleic acid 
molecule as described in the preceding paragraph. In a preferred embodiment, the 
nucleic acid molecule is operably linked to a promoter for the expression of a Hu-Asp 

10 polypeptide. Individual vectors which encode Hu-Asp 1, and Hu-Asp2(a), and 

Hu-Asp2(b) are all contemplated: Likewise, the invention contemplates a host cell 
comprising any of the foregoing vectors, as well as a method of obtaining a Hu-Asp 
polypeptide comprising culturing such a host cell and isolating the Hu-Asp 
polypeptide. Host cells of the invention include bacterial cells, such as £. coliy and 

1 5 eukaryotic cells. Among the eukaryolic cells that are contemplated are insect cells, 
such as sf9 or High 5 cells; and mammalian cells, such as human, rodent, lagomorph, 
and primate. Preferred human cells include HEK293, and IMR-32 cells. Other 
preferred mammalian cells include COS-7, CHO-Kl, Neuro-2A, and 3T3 cells. Also 
among the eukaryotic cells that are contemplated are a yeast cell and an avian cell. 

20 In a related aspect, the invention provides an isolated Hu-Asp 1 polypeptide 

comprising an amino acid sequence at least 95% identical to a sequence comprising 
the amino acid sequence of SEQ ID NO. 2. The invention also provides an isolated 
Hu-Asp2(a) polypeptide comprising an amino acid sequence at least 95% identical to 
a sequence comprising the amino acid sequence of SEQ ID NO. 4. The invention also 

25 provides an isolated Hu-Asp2(a) polypeptide comprising an amino acid sequence at 
least 95% identical to a sequence comprising the amino acid sequence of SEQ ID NO. 
8. 

in still another aspect, the invention provides an isolated antibody that binds 
specifically to any Hu-Asp polypeptide described herein, especially the polypeptide 
30 described in the preceding paragraphs. 
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The invention also provides several assays involving aspartyl protease 
enzymes of the invention. For example, the invention provides 

a method to identify a cell thai can be used to screen for inhibitors of |} 
secretase activity comprising: 
5 (a) identifying a cell that expresses a protease capable of cleaving APP at 

the P secretase site, comprising: 

i) collect the cells or the supernatant from the cells to be 

identified 

ii) measure the production of a critical peptide, where the critical 
1 0 peptide is selected from the group consisting of either the APP C-terminal peptide or 

soluble APP, 

iii) select the cells which produce the critical peptide. 

In one variation, the cells are collected and the critical peptide is the APP 
C-terminal peptide created as a result of the p secretase cleavage. In another 

15 variation, the supernatant is collected and the critical peptide is soluble APP, where 
the soluble APP has a C-terminus created by p secretase cleavage. In preferred 
embodiments, the cells contain any of the nucleic acids or polypeptides described 
above and the cells are shown to cleave the P secretase site of any peptide having the 
following peptide structure, P2, PI , PI', ?2\ where P2 is K or N, where PI is M or 

20 L, where PI ' is D, where P2' is A. The method of claim 1 1 1 where P2 is K and PI is 
M. The method of claim 1 12 where P2 is N and PI is L 

In still another aspect, the invention provides novel isofonns of amyloid 
protein precursor (APP) where the last two carboxy terminus amino acids of that 
isoform are both lysine residues. In this context, the term "isoform" is defined as any 

25 APP polypeptide, including APP variants (including mutations), and APP fragments 
that exists in humans, such as those described in US 5,766,846, col 7, lines 45-67, 
incorporated into this document by reference, modified as described herein by the 
inclusion of two C-terminal lysine residues. For example, the invention provides a 
polypeptide comprising the isoform known as APP695, modified to include two lysine 

30 residues as its last two carboxy tenninus amino acids. An exemplary polypeptide 
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comprises the amino acid sequence set forth in SEQ ID NO. 16. The invention further 
includes APP isoform variants as set forth in SEQ ID NOs. 1 8 and 20. The invention 
further includes all polynucleotides that encode an APP protein that has been modified 
to include two C-terminal lysines; as well has any eukaryotic cell line comprising such 
5 nucleic acids or polypeptides. Preferred cell lines include a mammalian cell line (e.g., 
HEK293, Neuro2a). 

Thus, in one embodiment, the invention provides a polypeptide comprising the 
amino acid sequence of a mammalian amyloid protein precursor (APP) or fragment 
thereof containing an APP cleavage site recognizable by a mammalian P-secretase, 

10 and further comprising two lysine residues at the carboxyl tenninus of the amino acid 
sequence of the mammalian APP or APP fragment. As taught herein in detail, the 
addition of two additional lysine residues to APP sequences has been found to greatly 
increase Ap processing of the APP in APP processing assays. Thus, the di-lysine 
modified APP reagents of the invention are particularly useful in assays to identify 

1 5 modulators of Ap production, for use in designing therapeutics for the treatment or 

prevention of Alzheimer's disease. In one embodiment, the polypeptide comprises the 
complete amino acid sequence of a mammalian amyloid protein precursor (APP), and 
further comprises the two lysine residues at the carboxyl terminus of the amino acid 
sequence of the manmialian amyloid protein precursor. In an alternative embodiment, 

20 the polypeptide comprises only a fragment of the APP, the fragment containing at 

least that portion of APP that is cleaved by a mammalian P-secrelase in the formation 
of Ap peptides. 

The practice of assays that monitor cleavage of APP can be facilitated by 
attaching a marker to a portion of the APP. Measurment of retained or liberated 

25 marker can be used to quantitate the amount of APP cleavage that occurs in the assay, 
e.g., in the presence or absence of a putative modulator of cleavage activity. Thus, in 
one preferred embodiment, the polypeptide of the invention further includes a marker. 
For example, the marker comprises a reporter protein amino acid sequence attached to 
the APP amino acid sequence. Exemplary reporter proteins include a fluorescing 

30 protein (e.g., green fluorescing proteins, luciferase) or an enzyme that is used to 
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cleave a substrate to produce a colorimetric cleavage product. Also contemplated are 
tag sequences which are commonly used as epitopes for quantitative immunoassays. 

In a prefened embodiment, the di-lysine-modified APP of the invention is a 
human APP. For example, human APP isoforms such as APP695, APP751 , and 
5 APP770, modified to include the two lysines, are contemplated. In a preferred 
embodiment, the APP isoform comprises at least one variation selected from the 
group consisting of a Sw^edish KM-NL mutation and a London WIM-Y mutation, or 
any other mutation that has been observed in a subpopulation that is particularly prone 
to development of Alzheimer's disease. These mutations are recognized as mutations 

10 that influence APP processing into Ap. In a highly preferred embodiment, the APP 
protein or fragment thereof comprises the APP-Sw P-secretase peptide sequence 
NLDA (SEQ ID NO: 66), which is associated with increased levels of Ap processing 
and therefore is particularly useful in assays relating to Alzheimer's research. More 
particularly, the APP protein or fragment thereof preferably comprises the APP-Sw P- 

1 5 secretase peptide sequence SEVNLDAEFR (SEQ ID NO: 63). 

In one preferred embodiment, the APP protein or fragment thereof further 
includes an APP transmembrane domain carboxy-terminal to the APP-Sw P-secretase 
peptide sequence. Polypeptides that include the TM domain are particularly useful in 
cell-based APP processing assays. In contrast, embodiments lacking the TM domain 

20 are useful in cell-free assays of APP processing. 

In addition to working with APP from humans and various animal models, 
researchers in the field of Alzheimer's also have construct chimeric APP polypeptides 
which include stretches of amino acids from APP of one species (e.g., humans) fused 
to streches of APP from one or more other species (e.g., rodent). Thus, in another 

25 embodiment of the polypeptide of the invention, the APP protein or fragment thereof 
comprises a chimeric APP, the chimeric APP including partial APP amino acid 
sequences from at least two species. A chimeric APP that includes amino acid 
sequence of a human APP and a rodent APP is particularly contemplated. 

In a related aspect, the invention provides a polynucleotide comprising a 

30 nucleotide sequence that encodes a polypeptide as described in the preceding 
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paragraphs. Such a polynucleotide is useful for recominant expression of the 
polypeptide of the invention for use in APP processing assays. In addition, the 
polynucleotide is useful for transforming into cells to produce recombinant cells that 
express the polypeptide of the invention, which cells are useful in cell-based assays to 
5 identify modulators of APP processing. Thus, in addition to polynucleotides, the 
invention provides a vector comprising such polynucleotides, especially expression 
vectors where the polynucleotide is operably linked to a promoter to promote 
expression of the polypeptide encoded by the polynucleotide in a host cell. The 
invention further provides a host cell transformed or transfected with a polynucleotide 

1 0 according to claim 1 4 or a vector according to claim 1 5 or 1 6. Among the preferred 
host cells are mammalian cells, especially human cells. 

In another, related embodiment, the invention provides a polypeptide useful 
for assaying for modulators of P-secretase activity, said polypeptide comprising an 
amino acid sequence of the formula NHj-X-Y-Z-KK-COOH; wherein X, Y, and Z 

1 5 each comprise an amino acid sequence of at least one amino acid; wherein-NH2-X 
comprises an amino-terminal amino acid sequence having at least one amino acid 
residue; wherein Y comprises an amino acid sequence of a p-secretase recognition site 
of a mammalian amyloid protein precursor (APP); and wherein Z-KK-COOH 
comprises a carboxy-terminal amino acid sequence ending in two lysine (K) residues. 

20 In one preferred variation, the carboxyl-terminal amino acid sequence Z includes a 
hyrdrophobic domain that is a transmembrane domain in host cells that express the 
polypeptide. Host cells that express such a polypeptide are particularly useful in 
assays described herein for identifying modulators of APP processing. In another 
preferred variation, the amino-terminal amino acid sequence X includes an amino acid 

25 sequence of a reporter or marker protein, as described above. In still another preferred 
variation, the p-secretase recognition site Y comprises the human APP-Sw P-secretase 
peptide sequence NLDA (SEQ ID NO: 66). It will be apparent that these preferred 
variations are not mutually exclusive of each other - they may be combined in a 
single polypeptide. The invention further provides a polynucleotide comprising a 

30 nucleotide sequence that encodes such polypeptides, vectors which comprise such 
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polynucleotides, and host cells which comprises such vectors, polynucleotides, and/or 
polypeptides. 

In yet another aspect, the invention provides a method for identifying 
inhibitors of an enzyme that cleaves the beta secretase cleavable site of APP 
5 comprising: 

a) culturing cells in a culture medium under conditions in which the 
enzyme causes processing of APP and release of amyloid beta-peptide into the 
medium and causes the accumulation of CTF99 fragments of APP in cell lysates, 

b) exposing the cultured cells to a test compound; and specifically 
1 0 determining whether the test compound inhibits the function of the enzyme by 

measuring the amount of amyloid beta-peptid.e released into the medium and/or the 
amount of CTF99 fragments of APP in cell lysates; 

c) identifying test compounds diminishing the amount of soluble amyloid 
beta peptide present in the culture medium and diminution of CTF99 fragments of 

15 APP in cell lysates as Asp2 inhibitors. In preferred embodiments, the cultured cells 
are a human, rodent or insect cell line. It is also preferred that the human or rodent 
cell line exhibits P secretase activity in which processing of APP occurs with release 
of amyloid beta-peptide into the culture medium and accumulation of CTF99 in cell 
lysates. Among the contemplated test compounds are antisense oligomers directed 

20 against the enzyme that exhibits P secretase activity, which oligomers reduce release 
of soluble amyloid beta-peptide into the culture medium and accumulation of CTF99 
in cell lysates. 

In yet another aspect, the invention provides a method for the identification of 
an agent that decreases the activity of a Hu-Asp polypeptide selected from the group 
25 consisting of Hu-Aspl , Hu-Asp2(a), and Hu-Asp2(b), the method comprising: 

a) determining the activity of said Hu-Asp polypeptide in the presence of 
a test agent and in the absence of a test agent; and 

b) comparing the activity of said Hu-Asp polypeptide determined in the 
presence of said test agent to the activity of said Hu-Asp polypeptide determined in 

30 the absence of said test agent; whereby a lower level of activity in the presence of said 
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test agent than in the absence of said test agent indicates that said test agent has 
decreased the activity of said Hu-Asp polypeptide. 

In a related aspect, the invention provides a method for assaying for 
modulators of p-secretase activity, comprising the steps of: 
5 (a) contacting a first composition with a second composition both in the 

presence and in the absence of a putative modulator compound, wherein the first 
composition comprises a mammalian p-secretase polypeptide or biologically active 
fi^agment thereof, and wherein the second composition comprises a substrate 
polypeptide having an amino acid sequence comprising a P-secretase cleavage site; 

1 0 (b) measuring cleavage of the substrate polypeptide in the presence and in the absence 
of the putative modulator compoimd; and (c) identifying modulators of P-secretase 
activity fi-om a difference in cleavage in the presence versus in the absence of the 
putative modulator compound. A modulator that is a P-secretase antagonist 
(inhibitor) reduces such cleavage, whereas a modulator that is a p-secretase agonist 

1 5 increases such cleavage. Since such assays are relevant to development of 

Alzheimer's disease therapeutics for humans, it will be readily apparent that, in one 
preferred embodiment, the first composition comprises a purified human Asp2 
polypeptide. In one variation, the first composition comprises a soluble fragment of a 
human Asp2 polypeptide that retains Asp2 P-secretase activity. Several such 

20 fi-agments (including ATM fi-agments) are described herein in detail. Thus, in a 
particular embodiment, the soluble fi-agment is a fi-agment lacking an Asp2 
transmembrane domain. 

The p-secretase cleavage site in APP is known, and it will be 
appreciated that the oassays of the invention can be performed with either intact APP 

25 or fragments or analogs of APP that retain the p-secretase recognition and cleavage 
site. Thus, in one variation, the substrate polypeptide of the second composition 
comprises the amino acid sequence SEVNLDAEFR (SEQ ID NO: 63), which 
includes the P-secretase recognition site of human APP that contains the "Swiss" 
mutation. In another variation, the substrate polypeptide of the second composition 

30 comprises the amino acid sequence EVKMDAEF (SEQ ID NO: 67). In another 
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variation, the second composition comprises a polypeptide having an amino acid 
sequence of a human amyloid precursor protein (APP). For example, the human 
amyloid precursor protein is selected from the group consisting of: APP695, APP75 1 , 
and APP770. Preferably, the human amyloid precursor protein (irrespective of 
5 isoform selected) includes at least on mutation selected from a KM-NL Swiss 
mutation and a V-F London mutation. As explained elsewhere, one preferred 
embodiment involves a variation wherein the polypeptide having an amino acid 
sequence of a human APP further comprises an amino acid sequence comprising a 
marker sequence attached amino-terminal to the amino acid sequence of the human 

10 amyloid precursor protein. Preferably, the polypeptide having an amino acid sequence 
of a human APP further comprises two lysine residues attached to the carboxyl 
terminus of the amino acid sequence of the human APP. The assays can be performed 
in a cell free setting, using cell-free enzyme and cell-free substrate, or can be 
performed in a cell-based assay wherein the second composition comprises a 

1 5 eukaryotic cell that expresses amyloid precursor protein (APP) or a fragment thereof 
containing a (J-secretase cleavage site. Preferably, the APP expressed by the host cell 
is an APP variant that includes two carboxyl-terminal lysine residues. It will also be 
appreciated that the P-secretase enzyme can be an enzyme that is expressed on the 
surface of the same cells. 

20 The present invention provides isolated nucleic acid molecules comprising a 

polynucleotide that codes for a polypeptide selected from the group consisting of 
human aspartyl proteases. In particular, human aspartyl protease 1 (Hu-Aspl) and 
two alternative splice variants of human aspartyl protease-2 (Hu-Asp2), a "long" (L) 
form designated herein as Hu-Asp2(a) and a "short" (S) form designated Hu-Asp2(b). 

25 As used herein, all references to "Hu-Asp" should be understood to refer to all of 

Hu-Aspl, Hu-Asp2(a), and Hu-Asp2(b). In addition, as used herein, all references to 
"Hu-Asp2" should be understood to refer to both Hu-Asp2(a) and Hu-Asp2(b). 
Hu-Aspl is expressed most abundantly in pancreas and prostate tissues, while 
Hu-Asp2(a) and Hu-Asp2{b) are expressed most abundantly in pancreas and brain 
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tissues. The invention also provides isolated Hu-Aspl, Hu-Asp2(a), and Hu-Asp2(b) 
polypeptides, as welJ as fragments thereof which exhibit aspartyl protease activity. 

In a preferred embodiment, the nucleic acid molecules comprise a polynucleotide 
having a nucleotide sequence selected from the group consisting of residues 1-1554 of 

5 SEQ ED NO. 1, encoding Hu-Aspl, residues 1-1503 of SEQ ID NO. 3, encoding 
Hu-Asp2(a), and residues 1-1428 of SEQ ID N0.5, encoding Hu-Asp2(b). In another 
aspect, the invention provides an isolated nucleic acid molecule comprising a 
polynucleotide which hybridizes under stringent conditions to a polynucleotide encoding 
Hu-Aspl , Hu- Asp2(a), Hu-Asp-2(b), or fragments thereof. European patent application 

10 EP 0 848 062 discloses a polypeptide referred to as "Asp 1,*' that bears substantial 
homology to Hu-Aspl, while international application WO 98/22597 discloses a 
polypeptide referred to as "Asp 2," that bears substantial homology to Hu- Asp2(a). 

The present invention also provides vectors comprising the isolated nucleic acid 
molecules of the invention, host cells into which such vectors have been introduced, and 

1 5 recombinant methods of obtaining a Hu-Aspl , Hu-Asp2(a), or Hu-Asp2(b) polypeptide 
comprising culturing the above-described host cell and isolating the relevant polypeptide. 

In another aspect, the invention provides isolated Hu-Aspl, Hu-Asp2(a), and 
Hu-Asp2(b) polypeptides, as well as fragments thereof. In a preferred embodiment, the 
Hu-Aspl , Hu-Asp2{a), and Hu-Asp2(b) polypeptides have the amino acid sequence given 

20 in SEQ ID NO. 2, SEQ ID NO. 4, or SEQ ID N0.6, respectively. The present invention 
also describes active forms of Hu-Asp2, methods for preparing such active forms, 
methods for preparing soluble forms, methods for measuring Hu-Asp2 activity, and 
substrates for Hu-Asp2 cleavage. The invention also describes antisense oligomers 
targeting the Hu-Aspl, Hu-Asp2(a) and Hu-Asp2(b) mRNA transcripts and the use of 

25 such antisense reagents to decrease such mRNA and consequently the production of the 
corresponding polypeptide. Isolated antibodies, both polyclonal and monoclonal, that 
binds specifically to any of the Hu-Aspl , Hu-Asp2(a), and Hu-Asp2(b) polypeptides of 
the invention are also provided. 

The invention also provides a method for the identification of an agent that 

30 modulates the activity of any of Hu- Asp- 1 , Hu- Asp2(a), and Hu-Asp2(b). The inventions 
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describes methods to test such agents in cell-free assays to which Hu-Asp2 polypeptide 
is added, as well as methods to test such agents in human or other mammalian cells in 
which Hu-Asp2 is present. 

Additional features and variations of the invention will be apparent to 
5 those skilled in the art from the entirety of this application, including the drawing and 
detailed description, and all such features are intended as aspects of the invention. 
Likewise, features of the invention described herein can be re-combined into additional 
embodiments that are also intended as aspects of the invention, irrespective of whether 
the combination of features is specifically mentioned above as an aspect or embodiment 

1 0 of the invention. Also, only such limitations which are described herein as critical to the 
invention should be viewed as such; variations of the invention lacking limitations which 
have not been described herein as critical are intended as aspects of the invention. 

In addition to the foregoing, the invention includes, as an additional 
aspect, all embodiments of the invention narrower in scope in any way than the variations 

15 specifically mentioned above. Although the applicant(s) invented the full scope of the 
claims appended hereto, the claims appended hereto are not intended to encompass 
within their scope the prior art work of others. Therefore, in the event that statutory prior 
art within the scope of a claim is brought to the attention of the applicants by a Patent 
Office or other entity or individual, the applicant(s) reserve the right to exercise 

20 amendment rights under applicable patent laws to redefine the subject matter of such a 
claim to specifically exclude such statutory prior art or obvious variations of statutory 
prior art from the scope of such a claim. Variations of the invention defined by such 
amended claims also are intended as aspects of the invention. 



25 BRIEF DESCRIPTION OF THE SEQUENCE LISTING 

Sequence ID No. 1 : Human Asp- 1 , nucleotide sequence. 

Sequence ID No. 2: Human Asp-1, predicted amino acid sequence. 

Sequence ID No. 3: Human Asp-2{a), nucleotide sequence. 

Sequence ID No. 4: Human Asp-2(a), predicted amino acid sequence. The 
30 Asp2(a) amino acid sequence includes a putative signal peptide comprising residues 1 
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lo 21 ; and a putative pre-propeplide after the signal peptide that extends through 
residue 45 (as assessed by processing observed of recombinant Asp2(a) in CHO cells), 
and a putative propeptide that may extend lo at least about residue 57, based on the 
observation of an observed GRR IGS (SEQ ID NO: 68) sequence which has 
5 characteristics of a protease recognition sequence. The Asp2(a) further includes a 
transmembrane domain comprising residues 455-477, a cytoplasmic domain 
comprising residues 478-501 , and a putative alpha-helical spacer region, comprising 
residues 420-454, believed to be unnecessary for proteolytic activity, behveen the 
protease catalytic domain and the transmembrane domain. 

10 Sequence ID No. 5: Human Asp-2(b), nucleotide sequence. 

Sequence ID No. 6: Human Asp-2(b), predicted amino acid sequence. The 
Asp2(b) amino acid sequence includes a putative signal peptide, pre-propeptide, and 
propeptide as described above for Asp2(a). The Asp2(b) further includes a 
transmembrane domain comprising residues 430-452, a cytoplasmic domain 

1 5 comprising residues 453-476, and a putative alpha-helical spacer region, comprising 
residues 395-429, believed to be unnecessary for proteolytic activity, between the 
protease catalytic domain and the transmembrane domain. 

Sequence ID No. 7: Murine Asp-2(a), nucleotide sequence. 

Sequence ID No. 8: Murine Asp-2(a), predicted amino acid sequence. The 

20 proteolytic processing of murine Asp2(a) is believed to be analogous to the processing 
described above for human Asp2(a). In addition, a variant lacking amino acid 
residues 190-214 of SEQ ID NO: 8 is specifically contemplated as a murine Asp2(b) 
polypeptide. 

Sequence ID No. 9: Human APP695, nucleotide sequence. 
25 Sequence ID No. 1 0: Human APP695, predicted amino acid sequence. 

Sequence ID No. 1 1 : Human APP695-Sw, nucleotide sequence. 
Sequence ID No. 12: Human APP695-Sw. predicted amino acid sequence. In 
the APP695 isoform, the Sw mutation is characterized by a KM-NL alteration at 
positions 595-596 (compared to normal APP695). 
30 Sequence ID No. 1 3 : Human APP695- VF, nucleotide sequence. 
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Sequence ID No. 14: Human APP695-VF, predicted amino acid sequence. In 

the APP 695 isofomi, the VF mutation is characterized by a V-F aheraiion at position 

642 (compared to normal APP 695). 

Sequence ID No. 15: Human APP695-KK, nucleotide sequence. 
5 Sequence ID No. 16: Human APP695-KK, predicted amino acid sequence. 

(APP695 with two carboxy-terminal lysine residues.) 

Sequence ID No. 1 7: Human APP695-Sw-KK, nucleotide sequence. 
Sequence ED No.l 8: Human APP695-Sw-KK, predicted amino acid sequence 
Sequence ID No. 1 9: Human APP695- VF-KK, nucleotide sequence 
1 0 Sequence ID No.20: Human APP695-VF-KK, predicted amino acid sequence 

Sequence ED No.21 : T7-Human-pro-Asp-2(a)ATM, nucleotide sequence 
Sequence ID No.22: T7-Human-pro-Asp-2(a)ATM, amino acid sequence 
Sequence ID No.23: T7-Caspase-Human-pro-Asp-2(a)ATM, nucleotide 

sequence 

1 5 Sequence ID No.24: T7-Caspase-Human-pro-Asp-2(a)ATM, amino acid 

sequence 

Sequence ID No.25: Human-pro-Asp-2(a)ATM (low GC), nucleotide 
sequence 

Sequence ID No.26: Human-pro-Asp-2(a)ATM, (low GC), amino acid 

20 sequence 

Sequence ID No.27: T7-Caspase-Caspase 8 
cleavage-Human-pro-Asp-2(a)ATM, nucleotide sequence 

Sequence ID No.28: T7-Caspase-Caspase 8 
cleavage-Human-pro-Asp-2(a)ATM, amino acid sequence 

25 Sequence ID No.29: Human Asp-2(a)ATM, nucleotide sequence 

Sequence ID No.30: Human Asp-2(a)ATM, amino acid sequence 
Sequence ID No.31 : Human Asp-2(a)ATM(His)(, , nucleotide sequence 
Sequence ID No. 32: Human Asp-2(a)ATM(His)t,, amino acid sequence 
Sequence ID Nos. 33-49 are short synthetic peptide and oligonucleotide 

30 sequences that are described below in the Detailed Description of the Invention. 
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Sequence ID No. 50: Human Asp2(b)ATM polynucleotide sequence. 

Sequence ID No. 51 : Human Asp2(b)ATM polypeptide sequence (exemplary 
variant of Human Asp2(b) lacking transmembrane and intracellular domains of Hu- 
Asp2(b) set forth in SEQ ID NO: 6. 
5 Sequence ID No. 52: Human Asp2(b)ATM(His)^ polynucleotide sequence. 

Sequence ID No. 53: Human Asp2(b)ATM(His)(^ polypeptide sequence 
(Human Asp2(b)ATM with six histidine lag attached to C-terminus) 

Sequence ID No. 54: Human APP770-encoding polynucleotide sequence. 

Sequence ID No. 55: Human APP770 polypeptide sequence. To introduce the 
10 KM-NL Swedish mutation, residues KM at positions 670-71 are changed to NL. To 
introduce the V-F London mutation, the V residue at position 717 is changed to F. 

Sequence ID No. 56: Human APP751 encoding polynucleotide sequence. 

Sequence ID No. 57: Human APP75I polypeptide sequence (Human APP751 
isoform). 

15 Sequence ID No. 58: Human APP770-KK encoding polynucleotide sequence. 

Sequence ID No. 59: Human APP770-KK polypeptide sequence. (Human 
APP770 isoform to which two C-lerminal lysines have been added). 

Sequence ID No. 60: Human APP751-KK encoding polynucleotide sequence. 
Sequence ID No. 61 : Human APP75 1 -KK polypeptide sequence (Human 
20 APP751 isoform to which two C-terminal lysines have been added). 

Sequence ID No. 62-65: Various short peptide sequences described in detail 
in detailed description. 



BRJEF DESCRIPTION OF THE FIGURES 

25 Figure 1 : Figure 1 shows the nucleotide (SEQ ID NO: 1 ) and predicted amino 

acid sequence (SEQ ID N0:2) of human Aspl . 

Figure2: Figure 2 shows the nucleotide (SEQ ID N0:3) and predicted amino 
acid sequence (SEQ ID N0:4) of human Asp2(a). 
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Figure 3: Figure 3 shows the nucleotide (SEQ IDN0:5) and predicted amino 
acid sequence (SEQ ID N0:6) of human Asp2(b). The predicted transmembrane domain 
of Hu-Asp2(b) is enclosed in brackets. 

Figure 4: Figure 4 shows the nucleotide (SEQ ID No. 7) and predicted amino 
5 acid sequence (SEQ ID No. 8) of murine Asp2(a) 

Figure 5: Figure 5 shows the BestFit alignment of the predicted amino acid 
sequences of Hu-Asp2(a) (SEQ ID NO: 4) and murine Asp2(a) (SEQ ID NO: 8). 

Figure 6: Figure 6 shows the nucleotide (SEQ ED No. 21) and predicted 
amino acid sequence (SEQ ED No. 22) of T7-Human-pro-Asp-2(a)ATM 
10 Figure 7: Figure 7 shows the nucleotide (SEQ ID No. 23) and predicted 

amino acid sequence (SEQ ID No. 24) of T7-caspase-Human-pro-Asp-2(a)ATM 

Figure 8: Figure 8 shows the nucleotide (SEQ ED No. 25) and predicted 
amino acid sequence (SEQ ID No. 26) of Human-pro- Asp-2(a) ATM (low GC) 

Figure 9: Western blot showing reduction of CTF99 production by 
1 3 HEK 1 25.3 cells transfected with antisense oligomers targeting the Hu-Asp2 mRNA. 

Figure 10: Western blot showing increase in CTF99 production in mouse 
Neuro-2a cells cotransfected with APP-KK with and without Hu-Asp2 only in those cells 
cotransfected with Hu-Asp2. A further increase in CTF99 production is seen in cells 
cotransfected with APP-Sw-KK with and without Hu-Asp2 only in those cells 
20 cotransfected with Hu-Asp2 

Figure 1 1 : Figure 1 1 shows the predicted amino acid sequence (SEQ ID No. 
30) of Human-Asp2(a)ATM 

Figure 12: Figure 1 1 shows the predicted amino acid sequence (SEQ ID No. 
30) of Human-Asp2(a)ATM(His), 

25 

DETAILED DESCRIPTION OF THE INVENTION 

A few definitions used in this invention follow, most definitions to be used are 
those that would be used by one ordinarily skilled in the art. 

The term "P amyloid peptide" means any peptide resulting firom beta secretase 
30 cleavage of APP. This includes peptides of 39, 40, 4 1 , 42 and 43 amino acids, extending 



-24- 



wo 01/49097 



PCT/IBOl/00797 



from the P-secretase cleavage site to 39, 40, 41 , 42 and 43 amino acids C- terminal to the 
P-secretase cleavage site. P amyloid peptide also includes sequences 1 -6, SEQ ID NOs. 
1 -6 of US 5,750,349, issued 1 2 May 1 998 (incorporated into this document by reference). 
A P-secretase cleavage fragment disclosed here is called CTF-99, which extends from 
5 p-secretase cleavage site to the carboxy terminus of APP. 

When an isoform of APP is discussed then what is meant is any APP 
polypeptide, including APP variants (including mutations), and APP fragments that 
exists in humans such as those described in US. 5,766,846, col 7, lines 45-67, 
incorporated into this document by reference. 

1 0 The term "P-amyloid precursor protein" (APP) as used herein is defined as a 

polypeptide that is encoded by a gene of the same name localized in humans on the 
long ami of chromosome 21 and that includes "PAP - here "p-amyloid protein" see 
above, within its carboxyl third. APP is a glycosylated, single-membrane spanning 
protein expressed in a wide variety of cells in many mammalian tissues. Examples of 

1 5 . specific isotypes of APP which are currently known to exist in humans are the 695 

amino acid polypeptide described by Kang et. al (1987) Nature 325:733-736 which is 
designated as the "normal" APP (SEQ ID NOs: 9-10); the 751 amino acid polypeptide 
described by Ponte et al. (1988) Nature 331:525-527 (1988) and Tanzi el al. (1988) 
Nature 331:528-530 (SEQ ID NOs: 56-57); and the 770-amino acid polypeptide 

20 described by Kitaguchi et. al. (1988) Nature 33 1 :530-532 (SEQ ID NOs: 54-55). 

Examples of specific variants of APP include point mutation which can differ in both 
position and phenotype (for review of known variant mutation see Hardy (1992) 
Nature Genet. 1:233-234). All references cited here incorporated by reference. The 
term "APP fragments" as used herein refers to fragments of APP other than those 

25 which consist solely of PAP or pAP fragments. That is, APP fragments will include 
amino acid sequences of APP in addition to those which form intact PAP or a 
Augment of pAP. 

When the term "any amino acid" is used^ the amino acids referred to are to be 
selected from the following, three letter and single letter abbreviations - which may 
30 also be used, are provided as follows: 
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Alanine, Ala, A; Arginine, Arg, R; Asparagine, Asn, N; Aspartic acid, Asp, 
D; Cysteine, Cys, C; Glutamine, Gin, Q; Glutamic Acid, Glu, E; Glycine, Gly, G; 
Histidine, His, H; Isoleucine, Jle, I; Leucine, Leu, L; Lysine, Lys, K; Methionine, 
Met, M; Phenylalanine, Phe, F; Proline, Pro, P; Serine, Ser, S; Threonine, Thr, T; 
5 Tryptophan, Trp, W; Tyrosine, Tyr, Y; Valine, Val, V; Aspartic acid or Asparagine, 
Asx, B; Glutamic acid or Glutamine, Glx, Z; Any amino acid, Xaa, X. 

The present invention describes a method to scan gene databases for the 
simple active site motif characteristic of aspartyl proteases. Eukaryotic aspartyl 
proteases such as pepsin and renin possess a two-domain structure which folds to 

1 0 bring two aspartyl residues into proximity within the active site. These are embedded 
in the short tripeptide motif DTG, or more rarely, DSG. Most aspartyl proteases occur 
as proenzyme whose N-terminus must be cleaved for activation. The DTG or DSG 
active site motif appears at about residue 65-70 in the proenzyme (prorenin, 
pepsinogen), but at about residue 25-30 in the active enzyme after cleavage of the 

1 5 N-terminal prodomain. The limited length of the active site motif makes it difficult to 
search collections of short, expressed sequence tags (EST) for novel aspartyl 
proteases. EST sequences typically average 250 nucleotides or less, and so would 
encode 80-90 amino acid residues or less. That would be too short a sequence to span 
the two active site motifs. The preferred method is to scan databases of hypothetical 

20 or assembled protein coding sequences. The present invention describes a computer 
method to identify candidate aspartyl proteases in protein sequence databases. The 
method was used to identify seven candidate aspartyl protease sequences in the 
Caenorhabditis elegans genome. These sequences were then used to identify by 
homology search Hu-Aspl and two alternative splice variants of Hu-Asp2, designated 

25 herein as Hu-Asp2(a) and Hu-Asp2(b). 

In a major aspect of the invention disclosed here we provide new information 
about APP processing. Pathogeneic processing of the amyloid precursor protein 
(APP) via the Ap pathway requires the sequential action of two proteases referred to 
as p-secretase and y-secretase. Cleavage of APP by the P-secretase and y-secretase 

30 generates the N-terminus and C-terminus of the Ap peptide, respectively. Because 
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over production of the AP peptide, particularly the AP,^^* has been implicated in the 
initiation of Alzheimer's disease, inhibitors of either the p-secretase and/or the 
Y-secretase have potential in the treatment of Alzheimer's disease. Despite the 
importance of the P-secretase and y-secretase in the pathogenic processing of APP, 
5 molecular definition of these enzymes has not been accomplished to date. That is, it 
was not known what enzymes were required for cleavage at either the P-secretase or 
the ysecretase cleavage site. The sites themselves were known because APP was 
known and the Ap,.42, peptide was known, see US 5,766,846 and US 5,837,672, 
(incorporated by reference, with the exception to reference to "soluble" peptides). But 

10 what enzyme was involved in producing the Ap,^2» peptide was unknown. 

Alignment of the amino acid sequences of Hu-Asp2 with other known aspartyl 
proteases reveals a similar domain organization. All of the sequences contain a signal 
sequence followed by a pro-segment and the catalytic domain containing 2 copies of 
the aspartyl protease active site motif (DTG/DSG) separated by approximately 180 

15 amino acid residues. Comparison of the processing site for proteolytic removal of the 
pro-segment in the mature forms of pepsin A, pepsin C, cathepsin D, cathepsin E and 
renin reveals that the mature forms of these enzymes contain between 31-35 amino 
acid residues upstream of the first DTG motif. Inspection of this region in the 
Hu-Asp-2 amino acid sequence indicates a preferred processing site within the 

20 sequence GRR i GS (SEQ ID NO: 68) as proteolytic processing of pro-protein 

precursors commonly occurs at site following dibasic amino acid pairs {eg. RR). 
Also, processing at this site would yield a mature enzyme with 35 amino acid residues 
upstream of the first DTG, consistent with the processing sites for other aspartyl 
proteases. In the absence of self-activation of Hu-Asp2 or a knowledge of the 

25 endogenous protease that processes Hu-Asp2 at this site, a recombinant form was 
engineered by introducing a recognition site for the PreSission protease 
(LEVLFQlGP; SEQ ID NO: 62) into the expression plasmids for bacterial, insect 
cell, and mammalian cell expression of pro-Hu-Asp2. In each case, the Gly residue in 
PI ' position corresponds to the Gly residue 35 amino acids upstream of the first DTG 

30 motif in Hu-Asp2. 
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The present invention involves the molecular definition of several novel 
human aspartyl proteases and one of these, referred to as Hu-Asp-2(a) and 
Hu-Asp2(b), has been characterized in detail. Previous forms of aspl and asp 2 have 
been disclosed, see EP 0848062 A2 and EP 0855444A2, inventors David Powel et al., 
5 assigned to Smith Kline Beecham Corp. (incorporated by reference). Herein are 
disclosed old and new forms of Hu-Asp 2. For the first time they are expressed in 
active form, their substrates are disclosed, and their specificity is disclosed. Prior to 
this disclosure cell or cell extracts were required to cleave the p-secretase site, now 
purified protein can be used in assays, also described here. Based on the results of (1) 

10 antisense knock out experiments, (2) transient transfection knock in experiments, and 
(3) biochemical experiments using purified recombinant Hu-Asp-2, we demonstrate 
that Hu-Asp-2 is the P-secretase involved in the processing of APP. Although the 
nucleotide and predicted amino acid sequence of Hu-Asp-2(a) has been reported, see 
above, see EP 0848062 A2 and EP 0855444 A2, no functional characterization of the 

15 enzyme was disclosed. Here the authors characterize the Hu-Asp-2 enzyme and are 
able to explain why it is a critical and essential enzyme required in the formation of 
AP|^2» peptide and possible a critical step in the development of AD. 

In another embodiment the present invention also describes a novel splice 
variant of Hu-Asp2, referred to as Hu-Asp-2(b), that has never before been disclosed. 

20 In another embodiment, the invention provides isolated nucleic acid molecules 

comprising a polynucleotide encoding a polypeptide selected from the group 
consisting of human aspartyl protease 1 (Hu-Asp I) and two alternative splice variants 
of human aspartyl protease-2 (Hu-Asp2), designated herein as Hu-Asp2(a) and 
Hu-Asp2(b). As used herein, all references to "Hu-Asp2" should be understood to 

25 refer to both Hu-Asp2(a) and Hu-Asp2(b). Hu-Aspl is expressed most abundantly in 
pancreas and prostate tissues, while Hu-Asp2(a) and Hu-Asp2(b) are expressed most 
abundantly in pancreas and brain tissues. The invention also provides isolated 
Hu-Aspl, Hu-Asp2(a), and Hu-Asp2(b) polypeptides, as well as fragments thereof 
which exhibit aspartyl protease activity. 
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The predicted amino acid sequences of Hu-Aspl, Hu-Asp2(a) and Hu-Asp2(b) 
share significant homology with previously identified mammalian aspartyl proteases 
such as pepsinogen A, pepsinogen B, cathepsin D, calhepsin E, and renin. P.B.Szecs, 
Scand, J. Clin. Lab. Invest, i2:(Suppl. 210 5-22 (1992)). These enzymes are 
5 characterized by the presence of a duplicated DTG/DSG sequence motif. The 
Hu-Aspl and HuAsp2 polypeptides disclosed herein also exhibit extremely high 
homology with the ProSite consensus motif for aspartyl proteases extracted from the 
SwissProt database. 

The nucleotide sequence given as residues 1-1554 of SEQ ID NO: 1 

10 corresponds to the nucleotide sequence encoding Hu-Aspl , the nucleotide sequence 
given as residues 1-1503 of SEQ ID N0:3 corresponds to the nucleotide sequence 
encoding Hu-Asp2(a), and the nucleotide sequence given as residues 1-1428 of SEQ 
ID N0:5 corresponds to the nucleotide sequence encoding Hu-Asp2(b). The isolation 
and sequencing of DNA encoding Hu-Aspl, Hu-Asp2(a), and Hu-Asp2(b) is 

1 5 described below in Examples 1 and 2. 

As is described in Examples 1 and 2, automated sequencing methods were 
used to obtain the nucleotide sequence of Hu-Aspl , Hu-Asp2(a), and Hu-Asp-2(b). 
The Hu-Asp nucleotide sequences of the present invention were obtained for both 
DNA strands, and are believed to be 1 00% accurate. However, as is known in the art, 

20 nucleotide sequence obtained by such automated methods may contain some errors. 
Nucleotide sequences determined by automation are typically at least about 90%, 
more typically at least about 95% to at least about 99.9% identical to the actual 
nucleotide sequence of a given nucleic acid molecule. The actual sequence may be 
more precisely determined using manual sequencing methods, which are well known 

25 in the art. An error in sequence which results in an insertion or deletion of one or 
more nucleotides may result in a frame shift in translation such that the predicted 
amino acid sequence will differ from that which would be predicted from the actual 
nucleotide sequence of the nucleic acid molecule, starting at the point of the mutation. 
The Hu-Asp DNA of the present invention includes cDNA, chemically synthesized 

30 DNA, DNA isolated by PCR, genomic DNA, and combinations thereof. Genomic 
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Hu-Asp DNA may be obtained by screening a genomic library with the Hu-Asp2 
cDNA described herein, using methods that are well known in the art, or with 
oligonucleotides chosen from the Hu-Asp2 sequence that will prime the polymerase 
chain reaction (PCR). RNA transcribed from Hu-Asp DNA is also encompassed by 
5 the present invention. 

Due to the degeneracy of the genetic code, two DNA sequences may differ and 
yet encode identical amino acid sequences. The present invention thus provides 
isolated nucleic acid molecules having a polynucleotide sequence encoding any of the 
Hu-Asp polypeptides of the invention, wherein said polynucleotide sequence encodes 

10 a Hu-Asp polypeptide having the complete amino acid sequence of SEQ ID N0:2, 
SEQ ID N0:4, SEQ ID N0:6, or fragments thereof 

Also provided herein are purified Hu-Asp polypeptides, both recombinant and 
non-recombinant. Most importantly, methods to produce Hu-Asp2 polypeptides in 
active form are provided. These include production of Hu-Asp2 polypeptides and 

15 variants thereof in bacterial cells, insect cells, and mammalian cells, also in forms that 
allow secretion of the Hu-Asp2 polypeptide from bacterial, insect or mammalian cells 
into the culture medium, also methods to produce variants of Hu-Asp2 polypeptide 
incorporating amino acid tags that facilitate subsequent purification. In a preferred 
embodiment of the invention the Hu-Asp2 polypeptide is converted to a 

20 proteolytically active form either in transformed cells or after purification and 

cleavage by a second protease in a cell-free system, such active forms of the Hu-Asp2 
polypeptide beginning with the N-terminal sequence TQHGIR (SEQ ID NO: 69) or 
ETDEEP (SEQ ID NO: 70). The sequence TQHGIR (SEQ DD NO: 69) represents the 
amino-terminus of Asp2(a) or Asp2(b) beginning with residue 22 of SEQ ID NO: 4 or 

25 6, after cleavage of a putative 21 residue signal peptide. Recombinant Asp2(a) 

expressed in and purified from insect cells was observed to have this amino terminus, 
presumably as a result of cleavage by a signal peptidase. The sequence ETDEEP 
(SEQ ID NO: 70) represents the amino-terminus of Asp2(a) or Asp2(b) beginning 
with residue 46 of SEQ ID NO: 4 or 6, as observed when Asp2(a) has been 

30 recombinantly produced in CHO cells (presumably after cleavage by both a rodent 
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signal peptidase and another rodent peptidase thai removes a propeptide sequence). 
The Asp2(a) produced in the CHO cells possesses P-secretase activity, as described in 
greater detail in Examples 1 1 and 12. Variants and derivatives, including fragments, 
of Hu-Asp proteins having the native amino acid sequences given in SEQ ID Nos: 2, 
5 4, and 6 that retain any of the biological activities of Hu-Asp are also within the scope 
of the present invention. Of course, one of ordinary skill in the art will readily be able 
to determine whether a variant, derivative, or fragment of a Hu-Asp protein displays 
Hu-Asp activity by subjecting the variant, derivative, or fragment to a standard 
aspartyl protease assay. Fragments of Hu-Asp within the scope of this invention 

10 include those that contain the active site domain containing the amino acid sequence 
DTG, fragments that contain the active site domain amino acid sequence DSG, 
fragments containing both the DTG and DSG active site sequences, fragments in 
which the spacing of the DTG and DSG active site sequences has been lengthened, 
fragments in which the spacing has been shortened. Also within the scope of the 

1 5 invention are fragments of Hu-Asp in which the transmembrane domain has been 

removed to allow production of Hu-Asp2 in a soluble form. In another embodiment of 
the invention, the two halves of Hu-Asp2, each containing a single active site DTG or 
DSG sequence can be produced independently as recombinant polypeptides, then 
combined in solution where they reconstitute an active protease. 

20 Thus, the invention provides a purified polypeptide comprising a fragment of a 

mammalian Asp2 protein, wherein said fragment lacks the Asp2 transmembrane 
domain of said Asp2 protein, and wherein the polypeptide and the fragment retain the 
P-secretase activity of said mammalian Asp2 protein. In a preferred embodiment, the 
purified polypeptide comprises a fragment of a human Asp2 protein that retains the P- 

25 secretase activity of the human Asp2 protein from which it was derived. Examples 
include: 

a purified polypeptide that comprises a fragment of Asp2(a) having the 
amino acid sequence set forth in SEQ ID NO: 4, wherein the polypeptide lacks 
transmembrane domain amino acids 455 to 477 of SEQ ID NO: 4; 
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a purified polypeptide as described in the preceding paragraph that 
further lacks cytoplasmic domain amino acids 478 to 501 of SEQ ID NO: 4; 

a purified polypeptide as described in either of the preceding 
paragraphs that further lacks amino acids 420-454 of SEQ ID NO: 4, which 
5 constitute a putative alpha helical region between the catalytic domain and the 

transmembrane domain that is believed to be unnecessary for (J-secretase 
activity; 

a purified polypeptide that comprises an amino acid sequence that 
includes amino acids 58 to 419 of SEQ ID NO: 4, and that lacks amino acids 
10 22 to 57 of SEQ ID NO: 4;- 

a purified polypeptide that comprises an amino acid sequence that 
includes amino acids 46 to 419 of SEQ ID NO: 4, and that lacks amino acids 
22 to 45 of SEQ ID NO: 4; 

a purified polypeptide that comprises an amino acid sequence that 
1 5 includes amino acids 22 to 454 of SEQ ID NO: 4. 

a purified polypeptide that comprises a fragment of Asp2(b) having the 
amino acid sequence set forth in SEQ ID NO: 6, and wherein said polypeptide 
lacks transmembrane domain amino acids 430 to 452 of SEQ ID NO: 6; 

a purified polypeptide as described in the preceding paragraph that 
20 further lacks cytoplasmic domain amino acids 453 to 476 of SEQ ID NO: 6; 

a purified polypeptide as described in either of the preceding two 
paragraphs that further lacks amino acids 395-429 of SEQ ID NO: 4, which 
constitute a putative alpha helical region between the catalytic domain and the 
transmembrane domain that is believed to be unnecessary for P-secretase 
25 activity; 

a purified polypeptide comprising an amino acid sequence that 
includes amino acids 58 to 394 of SEQ ID NO: 4, and that lacks amino acids 
22to57ofSEQIDNO: 4; 
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a purified polypeptide comprising an amino acid sequence that 
includes amino acids 46 to 394 of SEQ ID NO: 4, and that lacks amino acids 
22 to 45 ofSEQIDNO:4;and 

a purified polypeptide comprising an amino acid sequence thai 
5 includes amino acids 22 to 429 of SEQ ID NO: 4. 

Also included as part of the invention is a purified polynucleotide comprising a 
nucleotide sequence that encodes such polypeptides; a vector comprising a 
polynucleotide that encodes such polypeptides; and a host cell transformed or 
transfected with such a polynucleotide or vector. 

1 0 Hu-Asp variants may be obtained by mutation of native Hu-Asp-encoding 

nucleotide sequences, for example. A Hu-Asp variant, as referred to herein, is a 
polypeptide substantially homologous to a native Hu-Asp polypeptide but which has 
an amino acid sequence different fi^om that of native Hu-Asp because of one or more 
deletions, insertions, or substitutions in the amino acid sequence. The variant amino 

1 5 acid or nucleotide sequence is preferably at least about 80% identical, more 
preferably at least about 90% identical, and most preferably at least about 95% 
identical, to a native Hu-Asp sequence. Thus, a variant nucleotide sequence which 
contains, for example, 5 point mutations for every one hundred nucleotides, as 
compared to a native Hu-Asp gene, will be 95% identical to the native protein. The 

20 percentage of sequence identity, also termed homology, between a native and a variant 
Hu-Asp sequence may also be determined, for example, by comparing the two 
sequences using any of the computer programs commonly employed for this purpose, 
such as the Gap program (Wisconsin Sequence Analysis Package, Version 8 for Unix, 
Genetics Computer Group, University Research Park, Madison Wisconsin), which 

25 uses the algorithm of Smith and Waterman {Adv. Appl Math. 2: 482-489 (1981)). 

Alterations of the native amino acid sequence may be accomplished by any of 
a number of known techniques. For example, mutations may be introduced at 
particular locations by procedures well known to the skilled artisan, such as 
oligonucleotide-directed mutagenesis, which is described by Walder et ai {Gene 

30 42: 133(1 986)); Bauer et al {Gene 37:73 { 1 985)); Craik {BioTechniques, January 
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1985, pp. 12-19); Smith et ai {Genetic Engineering: Principles and Methods, 
Plenum Press (1981)); and U.S. Patent Nos. 4,518,584 and 4,737,462. 

Hu-Asp variants within the scope of the invention may comprise 
conservatively substituted sequences, meaning that one or more amino acid residues 
5 of a Hu-Asp polypeptide are replaced by different residues that do not alter the 

secondary and/or tertiary structure of the Hu-Asp polypeptide. Such substitutions may 
include the replacement of an amino acid by a residue having similar physicochemical 
properties, such as substituting one aliphatic residue (He, Val, Leu or Ala) for another, 
or substitution between basic residues Lys and Arg, acidic residues Glu and Asp, 

10 amide residues Gin and Asn, hydroxyl residues Ser and Tyr, or aromatic residues Phe 
and Tyr. Further information regarding making phenotypically silent amino acid 
exchanges may be found in Bowie et aL Science 247:1306-1310 (1990). Other 
Hu-Asp variants which might retain substantially the biological activities of Hu-Asp 
are those where amino acid substitutions have been made in areas outside functional 

1 5 regions of the protein. 

In another aspect, the invention provides an isolated nucleic acid molecule 
comprising a polynucleotide which hybridizes under stringent conditions to a portion 
of the nucleic acid molecules described above, e.g., to at least about 15 nucleotides, 
preferably to at least about 20 nucleotides, more preferably to at least about 30 

20 nucleotides, and still more preferably to at least about from 30 to at least about 100 
nucleotides, of one of the previously described nucleic acid molecules. Such portions 
of nucleic acid molecules having the described lengths refer to, e.g., at least about 15 
contiguous nucleotides of the reference nucleic acid molecule. By stringent 
hybridization conditions is intended overnight incubation at about 42°C for about 2.5 

25 hours in 6 X SSC/0. 1 % SDS, followed by washing of the filters four times for 1 5 
minutes in 1.0 X SSC at 65^C, 0, 1 % SDS. 

Fragments of the Hu-Asp encoding nucleic acid molecules described herein, as 
well as polynucleotides capable of hybridizing to such nucleic acid molecules may be 
used as a probe or as primers in a polymerase chain reaction (PGR). Such probes may 

30 be used, e.g,, to detect the presence of Hu-Asp nucleic acids in in vitro assays, as well 
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as in Southem and northern blots. Cell types expressing Hu-Asp may also be 
identified by the use of such probes. Such procedures are well known, and the skilled 
artisan will be able to choose a probe of a length suitable to the particular application. 
For PCR, 5' and 3' primers corresponding to the termini of a desired Hu-Asp nucleic 
5 acid molecule are employed to isolate and amplify that sequence using conventional 
techniques. 

Other useful fragments of the Hu-Asp nucleic acid molecules are antisense or 
sense oligonucleotides comprising a single stranded nucleic acid sequence capable of 
binding to a target Hu-Asp mRNA (using a sense strand), or Hu-Asp DNA (using an 

1 0 antisense strand) sequence. In a preferred embodiment of the invention these Hu-Asp 
antisense oligonucleotides reduce Hu-Asp mRNA and consequent production of 
Hu-Asp polypeptides. 

In another aspect, the invention includes Hu-Asp polypeptides with or without 
associated native pattern glycosylation. Both Hu-Asp 1 and Hu-Asp2 have canonical 

1 5 acceptor sites for Asn-linked sugars, with Hu-Asp 1 having two of such sites, and 

Hu-Asp2 having four. Hu-Asp expressed in yeast or mammalian expression systems 
(discussed below) may be similar to or significantly different from a native Hu-Asp 
polypeptide in molecular weight and glycosylation pattern. Expression of Hu-Asp in 
bacterial expression systems will provide non-glycosylated Hu-Asp. 

20 The polypeptides of the present invention are preferably provided in an 

isolated form, and preferably are substantially purified. Hu-Asp polypeptides may be 
recovered and purified fi^om tissues, cultured cells, or recombinant cell cultures by 
well-known methods, including ammonium sulfate or ethanol precipitation, anion or 
cation exchange chromatography, phosphocellulose chromatography, hydrophobic 

25 interaction chromatography, affinity chromatography, hydroxylapatite 

chromatography, lectin chromatography, and high performance liquid chromatography 
(HPLC). In a preferred embodiment, an amino acid tag is added to the Hu-Asp 
polypeptide using genetic engineering techniques that are well known to practitioners 
of the art which include addition of six histidine amino acid residues to allow 

30 purification by binding to nickel immobilized on a suitable support, epitopes for 
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polyclonal or monoclonal antibodies including but not limited to the T7 epitope, the 
myc epitope, and the V5a epitope, and fusion of Hu-A$p2 to suitable protein partners 
including but not limited to glutathione-S-transferase or maltose binding protein. In a 
preferred embodiment these additional amino acid sequences are added to the 
5 C-terminus of Hu-Asp but may be added to the N-terminus or at intervening positions 
within the Hu-Asp2 polypeptide. 

The present invention also relates to vectors comprising the polynucleotide 
molecules of the invention, as well as host cell transformed with such vectors. Any of 
the polynucleotide molecules of the invention may be joined to a vector, which 

10 generally includes a selectable marker and an origin of replication, for propagation in 
a host. Because the invention also provides Hu-Asp polypeptides expressed from the 
polynucleotide molecules described above, vectors for the expression of Hu-Asp are 
preferred. The vectors include DNA encoding any of the Hu-Asp polypeptides 
described above or below, operably linked to suitable transcriptional or translational 

15 regulatory sequences, such as those derived from a mammalian, microbial, viral, or 
insect gene. Examples of regulatory sequences include transcriptional promoters, 
operators, or enhancers, mRNA ribosomal binding sites, and appropriate sequences 
which control transcription and translation. Nucleotide sequences are operably linked 
when the regulatory sequence functionally relates to the DNA encoding Hu-Asp. 

20 Thus, a promoter nucleotide sequence is operably linked to a Hu-Asp DNA sequence 
if the promoter nucleotide sequence directs the transcription of the Hu-Asp sequence. 

Selection of suitable vectors to be used for the cloning of polynucleotide 
molecules encoding Hu-Asp, or for the expression of Hu-Asp polypeptides, will of 
course depend upon the host cell in which the vector will be transformed, and, where 

25 applicable, the host cell from which the Hu-Asp polypeptide is to be expressed. 

Suitable host cells for expression of Hu-Asp polypeptides include prokaryotes, yeast, 
and higher eukaryotic cells, each of which is discussed below. 

The Hu-Asp polypeptides to be expressed in such host cells may also be fusion 
proteins which include regions from heterologous proteins. Such regions may be 

30 included to allow, e.g., secretion, improved stability, or facilitated purification of the 
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polypeptide. For example, a sequence encoding an appropriate signal peptide can be 
incorporated into expression vectors. A DNA sequence for a signal peptide (secretory 
leader) may be fused inframe to the Hu-Asp sequence so that Hu-Asp is translated as a 
fusion protein comprising the signal peptide. A signal peptide that is functional in the 
5 intended host cell promotes extracellular secretion of the Hu-Asp polypeptide. 

Preferably, the signal sequence will be cleaved from the Hu-Asp polypeptide upon 
secretion of Hu-Asp from the cell. Nonlimiting examples of signal sequences that can 
be used in practicing the invention include the yeast Ifactor and the honeybee melatin 
leader in sf9 insect cells. 

10 In a preferred embodiment, the Hu-Asp polypeptide will be a fusion protein 

which includes a heterologous region used to facilitate purification of the polypeptide. 
Many of the available peptides used for such a function allow selective binding of the 
fusion protein to a binding partner. For example, the Hu-Asp polypeptide may be 
modified to comprise a peptide to form a fiision protein which specifically binds to a 

1 5 binding partner, or peptide tag. Nonlimiting examples of such peptide tags include the 
6-His tag, thioredoxin tag, hemaglutinin tag, GST tag, and OmpA signal sequence tag. 
As will be understood by one of skill in the art, the binding partner which recognizes 
and binds to the peptide may be any molecule or compound including metal ions (e.g., 
metal affinity columns), antibodies, or fragments thereof, and any protein or peptide 

20 which binds the peptide, such as the FLAG tag. 

Suitable host cells for expression of Hu-Asp polypeptides includes 
prokaryotes, yeast, and higher eukaryotic cells. Suitable prokaryotic hosts to be used 
for the expression of Hu-Asp include bacteria of the genera Escherichia, Bacillus, and 
Salmonella, as well as members of the genera Pseudomonas, Streptomyces, and 

25 Staphylococcus. For expression in, e.g., £. coli, a Hu-Asp polypeptide may include 
an N-tenninal methionine residue to facilitate expression of the recombinant 
polypeptide in a prokaiyotic host. The N-terminal Met may optionally then be 
cleaved from the expressed Hu-Asp polypeptide. Other N-terminal amino acid 
residues can be added to the Hu-Asp polypeptide to facilitate expression in 

30 Escherichia coli including but not limited lo the T7 leader sequence, the T7-caspase 8 
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leader sequence, as well as others leaders including tags for purification such as the 
6-His tag (Example 9). Hu>Asp polypeptides expressed in E. coli may be shortened 
by removal of the cytoplasmic tail, the transmembrane domain, or the membrane 
proximal region. Hu-Asp polypeptides expressed in E. coli may be obtained in either 
5 a soluble form or as an insoluble form which may or may not be present as an 

inclusion body. The insoluble polypeptide may be rendered soluble by guanidine 
HCl, urea or other protein denaturants, then refolded into a soluble form before or 
after purification by dilution or dialysis into a suitable aqueous buffer. If the inactive 
proform of the Hu-Asp was produced using recombinant methods, it may be rendered 

10 active by cleaving off the prosegment with a second suitable protease such as human 
immunodeficiency virus protease. 

Expression vectors for use in prokaryotic hosts generally comprises one or 
more phenotypic selectable marker genes. Such genes generally encode, e.g., a 
protein that confers antibiotic resistance or that supplies an auxotrophic requirement. 

15 A wide variety of such vectors are readily available firom commercial sources. 

Examples include pSPORT vectors, pGEM vectors (Promega), pPROEX vectors 
(LTI, Bethesda, MD), Bluescript vectors (Stratagene), pET vectors (Novagen) and 
pQE vectors (Qiagen). 

Hu-Asp may also be expressed in yeast host cells from genera including 

20 Saccharomyces, Pichia, and Kluveromyces. Preferred yeast hosts are S, cerevisiae 
and P. pastoris. Yeast vectors will often contain an origin of replication sequence 
from a 2T yeast plasmid, an autonomously replicating sequence (ARS), a promoter 
region, sequences for polyadenylation, sequences for transcription termination, and a 
selectable marker gene. Vectors replicable in both yeast and E. coli (termed shuttle 

25 vectors) may also be used. In addition to the above-mentioned features of yeast 

vectors, a shuttle vector will also include sequences for replication and selection in E. 
coli. Direct secretion of Hu-Asp polypeptides expressed in yeast hosts may be 
accomplished by the inclusion of nucleotide sequence encoding the yeast I- factor 
leader sequence at the 5* end of the Hu-Asp-encoding nucleotide sequence. 
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Insect host cell culture systems may also be used for the expression of Hu-Asp 
polypeptides. In a preferred embodiment, the Hu-Asp polypeptides of the invention 
are expressed using an insect cell expression system (see Example 10). Additionally, a 
baculovirus expression system can be used for expression in insect cells as reviewed 
5 by Luckow and Summers, Bio/Technology 6:47 ( 1 988). 

In another preferred embodiment, the Hu-Asp polypeptide is expressed in 
mammahan host cells. Nonlimiting examples of suitable mammalian cell lines 
include the COS? line of monkey kidney cells (Gluzman ei aL, Cell 23:175 (1981)), 
human embyonic kidney cell line 293, and Chinese hamster ovary (CHO) cells. 

10 Preferably, Chinese hamster ovary (CHO) cells are used for expression of Hu-Asp 
proteins (Example 11). 

The choice of a suitable expression vector for expression of the Hu-Asp 
polypeptides of the invention will of course depend upon the specific mammalian host 
cell to be used, and is within the skill of the ordinary artisan. Examples of suitable 

15 expression vectors include pcDNA3 (Invitrogen) and pSVL (Pharmacia Biotech). A 
prefened vector for expression of Hu-Asp polypeptides is pcDNA3.1-Hygro 
(Invitrogen). Expression vectors for use in mammalian host cells may include 
transcriptional and translational control sequences derived from viral genomes. 
Commonly used promoter sequences and enhancer sequences which may be used in 

20 the present invention include, but are not limited to, those derived from human 

cytomegalovirus (CMV), Adenovirus 2, Polyoma virus, and Simian virus 40 (SV40). 
Methods for the construction of mammalian expression vectors are disclosed, for 
example, in Okayama and Berg (Mol. Cell. Biol, i:280 (1983)); Cosman et ai (Mol. 
Immunol, 25:935 (1986)); Cosman et ai (Nature 312:16% (1984)); EP-A-0367566; 

25 and WO 91/18982. 

The polypeptides of the present invention may also be used to raise polyclonal 
and monoclonal antibodies, which are useful in diagnostic assays for detecting 
Hu-Asp polypeptide expression. Such antibodies may be prepared by conventional 
techniques. See, for example, Antibodies: A Laboratory Manual^ Harlow and Land 

30 (eds.), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N. Y., ( 1 988); 
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Monoclonal Antibodies, Hybridomas: A New Dimension in Biological Analyses, 
Kennet ei aL (eds.), Plenum Press, New York (1980). Synthetic peptides comprising 
portions of Hu-Asp containing 5 to 20 amino acids may also be used for the 
production of polyclonal or monoclonal antibodies afler linkage to a suitable carrier 
5 protein including but not limited to keyhole limpet hemacyanin (KLH), chicken 

ovalbumin, or bovine serum albumin using various cross-linking reagents including 
carbodimides, glutaraldehyde, or if the peptide contains a cysteine, 
N-methylmaleimide. A preferred peptide for immunization when conjugated to KLH 
contains the C-terminus of Hu-Asp 1 or Hu-Asp2 comprising 
10 QRRPRDPEVVNDESSLVRHRWK (SEQ ID NO: 2, residues 497-518) or 

LRQQHDDFADDISLLK (SEQ ID N0:4, residues 486-501), respectively. See SEQ 
ID Nos. 33-34. 

The Hu-Asp nucleic acid molecules of the present invention are also valuable 
for chromosome identification, as they can hybridize with a specific location on a 

1 5 human chromosome. Hu-Aspl has been localized to chromosome 2 1 , while 

Hu-Asp2 has been localized to chromosome 1 lq23.3-24.1 . There is a current need for 
identifying particular sites on the chromosome, as few chromosome marking reagents 
based on actual sequence data (repeat polymorphisms) are presently available for 
marking chromosomal location. Once a sequence has been mapped to a precise 

20 chromosomal location, the physical position of the sequence on the chromosome can 
be correlated with genetic map data. The relationship between genes and diseases that 
have been mapped to the same chromosomal region can then be identified through 
linkage analysis, wherein the coinheritance of physically adjacent genes is determined. 
Whether a gene appearing to be related to a particular disease is in fact the cause of 

25 the disease can then be determined by comparing the nucleic acid sequence between 
affected and unaffected individuals. 

In another embodiment, the invention relates to a method of assaying Hu-Asp 
function, specifically Hu-Asp2 function which involves incubating in solution the 
Hu-Asp polypeptide with a suitable substrate including but not limited to a synthetic 

30 peptide containing the P-secretase cleavage site of APP, preferably one containing the 
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mutation found in a Swedish kindred with inherited AD in which KM is changed to 
NL, such peptide comprising the sequence SEVNLDAEFR (SEQ E) NO: 63) in an 
acidic buffering solution, preferably an acidic buffering solution of pH5.5 (see 
Example 12) using cleavage of the peptide monitored by high performance liquid 
5 chromatography as a measure of Hu-Asp proteolytic activity. Preferred assays for 

proteolytic activity utilize internally quenched peptide assay substrates. Such suitable 
substrates include peptides which have attached a paired flurophore and quencher 
including but not limited to 7-amino-4-methyl coumarin and dinitrophenol, 
respectively, such that cleavage of the peptide by the Hu-Asp results in increased 

10 fluorescence due to physical separation of the flurophore and quencher. Other paired 
flurophores and quenchers include bodipy-tetramethylrhodamine and QSY-5 
(Molecular Probes, Inc.). In a variant of this assay, biotin or another suitable tag may 
be placed on one end of the peptide to anchor the peptide to a substrate assay plate and 
a fluropihore may be placed at the other end of the peptide. Useful flurophores include 

IS those listed above as well as Europium labels such as W8044 (EG&g Wallac, Inc.). 
Cleavage of the peptide by Asp2 will release the flurophore or other tag from the 
plate, allowing compounds to be assayed for inhibition of Asp2 proteolytic cleavage 
as shown by an increase in retained fluorescence. Preferred colorimetric assays of 
Hu-Asp proteolytic activity utilize other suitable substrates that include the P2 and PI 

20 amino acids comprising the recognition site for cleavage linked to o-nitrophenol 

through an amide linkage, such that cleavage by the Hu-Asp results in an increase in 
optical density after altering the assay buffer to alkaline pH. 

In another embodiment, the invention relates to a method for the identification 
of an agent that increases the activity of a Hu-Asp polypeptide selected from the group 

25 consisting of Hu-Asp 1 , Hu-Asp2(a), and Hu-Asp2(b), the method comprising 

(a) determining the activity of said Hu-Asp polypeptide in the presence of 
a test agent and in the absence of a test agent; and 

(b) comparing the activity of said Hu-Asp polypeptide determined in the 
presence of said test agent to the activity of said Hu-Asp polypeptide determined in 

30 the absence of said test agent; 
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whereby a higher level of activity in the presence of said test agent than in the absence 
of said test agent indicates that said test agent has increased the activity of said 
Hu-Asp polypeptide. Such tests can be performed with Hu-Asp polypeptide in a cell 
free system and with cultured cells that express Hu-Asp as well as variants or 
5 isoforms thereof 

In another embodiment, the invention relates to a method for the identification 
of an agent that decreases the activity of a Hu-Asp polypeptide selected from the 
group consisting of Hu-Asp 1, Hu-Asp2(a), and Hu-Asp2(b), the method comprising 

(a) determining the activity of said Hu-Asp polypeptide in the presence of 
10 a test agent and in the absence of a test agent; and 

(b) comparing the activity of said Hu-Asp polypeptide determined in the 
presence of said test agent to the activity of said Hu-Asp polypeptide determined in 
the absence of said test agent; 

whereby a lower level of activity in the presence of said test agent than in the absence 
IS of said test agent indicates that said test agent has decreased the activity of said 

Hu-Asp polypeptide. Such tests can be performed with Hu-Asp polypeptide in a cell 
free system and with cultured cells that express Hu-Asp as well as variants or 
isoforms thereof. 

In another embodiment, the invention relates to a novel cell line (HEK 125.3 
20 cells) for measuring processing of amyloid P peptide (AP) from the amyloid protein 
precursor (APP). The cells are stable transformants of human embryonic kidney 293 
cells (HEK293) with a bicistronic vector derived from pIRES-EGFP (Clontech) 
containing a modified human APP cDNA, an internal ribosome entry site and an 
enhanced green fluorescent protein (EGFP) cDNA in the second cistron. The APP 
25 cDNA was modified by adding two lysine codons to the carboxyl terminus of the 

APP coding sequence. This increases processing of Ap peptide from human APP by 
2-4 fold. This level of Ap peptide processing is 60 fold higher than is seen in 
nontransfomied HEK293 cells. HEK 125.3 cells will be useful for assays of 
compounds that inhibit AP peptide processing. This invention also includes addition 
30 of two lysine residues to the C-terminus of other APP isoforms including the 75 1 and 
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770 amino acid isoforms, to isoforms of APP having mutations found in human AD 
including the Swedish KJVI-NL and V717^F mutations, to C-terminal fragments of 
APP, such as those beginning with the P-secretasc cleavage site, to C-terminal 
fragments of APP containing the p-secretase cleavage site which have been operably 
5 linked to an N-terminal signal peptide for membrane insertion and secretion, and to 
C-terminal fragments of APP which have been operably linked to an N-terminal 
signal peptide for membrane insertion and secretion and a reporter sequence including 
but not limited to green fluorescent protein or alkaline phosphatase, such that 
jj-secretase cleavage releases the reporter protein from the surface of cells expressing 
10 the polypeptide. 

Having generally described the invention, the same will be more 
readily understood by reference to the following examples, which are provided by way 
of illustration and are not intended as limiting. 

IS Example! 

Development of a Search Algorithm Useful for the 
Identification of Aspartyl Proteases, and Identification 
of C. elegans Aspartyl Protease Genes in Wormpep 12 

Materials and Methods: 

20 Classical aspartyl proteases such as pepsin and renin possess a two-domain structure 
which folds to bring two aspartyl residues into proximity within the active site. These 
are embedded in the short tripeptide motif DTG, or more rarely, DSG. The DTG or 
DSG active site motif appears at about residue 25-30 in the enzyme, but at about 
65-70 in the proenzyme (prorenin, pepsinogen). This motif appears again about 

25 1 50-200 residues downstream. The proenzyme is activated by cleavage of the 

N-tenminal prodomain. This pattern exemplifies the double domain structure of the 
modem day aspartyl enzymes which apparently arose by gene duplication and 
divergence. Thus; 

NH^ X D"TG Y D^*"TG~'- C 

30 where X denotes the beginning of the enzyme, following the N-terminal prodomain, 
and Y denotes the center of the molecule where the gene repeat begins again. 
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In the case of the retroviral enzymes such as the HIV protease, they represent 
only a half of the two-domain structures of well-known enzymes like pepsin, 
cathepsin D, renin, etc. They have no prosegment, but are carved out of a polyprotein 
precursor containing the gag and pol proteins of the virus. They can be represented 
5 by: 

NH2 D'^TG - CI 00 

This "monomer" only has about 1 00 aa, so is extremely parsimonious as compared to 
the other aspartyl protease "dimers" which have of the order of 330 or so aa, not 
counting the N-terminal prodomain. 

10 The limited length of the eukaryotic aspartyl protease active site motif makes 

it diflicult to search EST collections for novel sequences. EST sequences typically 
average 250 nucleotides, and so in this case would be unlikely to span both aspartyl 
protease active site motifs. Instead; we turned to the C. elegans genome. The C 
elegans genome is estimated to contain around 13,000 genes. Of these, roughly 

1 5 1 2,000 have been sequenced and the corresponding hypothetical open reading frame 
(ORF) has been placed in the database Wormpepl2. We used this database as the 
basis for a whole genome scan of a higher eukaryote for novel aspartyl proteases, 
using an algorithm that we developed specifically for this purpose. The following 
AWK script for locating proteins containing two DTG or DSG motifs was used for 

20 the search, which was repeated four times to recover all pairwise combinations of the 
aspartyl motif 

BEGIN {RS=''>"} /* defines as record separator for FASTA format */ 

{ 

pos = index($0,"DTG'') /*finds "DTG" in record*/ 
25 if (pos>0) { 

rest = substr($0,pos+3) /*get rest of record after first DTG*/ 

pos2 = index(rest,'T)TG'') /*find second DTG*/ 

if (pos2>0) printf ("%s%s\n'V'>",$0)} /*report hits*/ 

} 

30 } 

The AWK script shown above was used to search WorTnpepl2, which was 
downloaded from ftp.sanger.ac.uk/pub/databases/wormpep, for sequence entries 
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conlaining at least two DTG or DSG motifs. Using AWK limited each record to 3000 
characters or less. Thus, 35 or so larger records were eliminated manually from 
Wormpepl2 as in any case these were unlikely to encode aspartyl proteases. 
Results and Discussion: 
5 The Wormpep 12 database contains 12,1 78 entries, although some of these 

(<10%) represent alternatively spliced transcripts from the same gene. Estimates of 
the number of genes encoded in the C elegans genome is on the order of 13,000 
genes, so Wormpep 12 may be estimated to cover greater than 90% of the C. elegans 
genome. 

10 Eukaryotic aspartyl proteases contain a two-domain structure, probably arising 

from ancestral gene duplication. Each domain contains the active site motif D(S/T)G 
located from 20-25 amino acid residues into each domain. The retroviral (e.g., HFV 
protease) or retrotransposon proteases are homodimers of subunits which are 
homologous to a single eukaryotic aspartyl protease domain. An AWK script was 

15 used to search the Wonnpepl2 database for proteins in which the D(S/T)G motif 
occurred at least twice. This identified >60 proteins with two DTG or DSG motifs. 
Visual inspection was used to select proteins in which the position of the aspartyl 
domains was suggestive of a two-domain structure meeting the criteria described 
above. 

20 In addition, the PROSITE eukaryotic and viral aspartyl protease active site 

pattern PS00141 was used to search Wormpepl2 for candidate aspartyl proteases. 
(Bairoch A., Bucher P., Hofmann K., The PROSITE database: its status in 1997, 
Nucleic Acids Res, 24:2 1 7-22 1 ( 1 997)). This generated an overlapping set of 
Wonnpepl2 sequences. Of these, seven sequences contained two DTG or DSG 

25 motifs and the PROSITE aspartyl protease active site pattern. Of these seven, three 
were found in the same cosmid clone (F21F8.3, F21F8.4, and F21F8.7) suggesting 
that they represent a family of proteins that arose by ancestral gene duplication. Two 
other ORFs with extensive homology to F21F8.3, F21F8.4 and F21F8.7 are present in 
the same gene cluster (F21F8.2 and F21F8.6), however, these contain only a single 

30 DTG motif. Exhaustive BLAST searches with these seven sequences against 
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Wormpepl2 failed to reveal additional candidate aspartyl proteases in the C elegans 
genome containing two repeats of the DTG or DSG motif 

BLASTX search with each C elegans sequence against SWISS-PROT, 
GenPep and TREMBL revealed that R12H7.2 was the closest worm homologue to the 
5 known mammalian aspartyl proteases, and that Tl 8H9.2 was somewhat more 

distantly related, while CEASPl, F21F8.3, F21F8.4, and F21F8.7 formed a subcluster 
which had the least sequence homology to the mammalian sequences. 
Discussion: 

APP, the presenilins, and p35, the activator of cdk5, all undergo intracellular 
1 0 proteolytic processing at sites which confonn to the substrate specificity of the HIV 
protease. Dysregulation of a cellular aspartyl protease with the same substrate 
specificity, might therefore provide a unifying mechanism for causation of the plaque 
and tangle pathologies in AD. Therefore, we sought to identify novel human aspartyl 
proteases. A whole genome scan in C. elegans identified seven open reading fi-ames 
1 5 that adhere to the aspartyl protease profile that we had identified. These seven 

aspartyl proteases probably comprise the complete complement of such proteases in a 
simple, multicellular eukaryote. These include four closely related aspartyl proteases 
unique to C elegans which probably arose by duplication of an ancestral gene. The 
other three candidate aspartyl proteases (T18H9.2, R12H7.2 and CI 1D2.2) were 
20 found to have homology to manmialian gene sequences. 

Example 2 



25 



Identification of Novel Human Aspartyl 
Proteases Using Database Mining by Genome Bridging 



Materials and Methods : 

Computer-assisted analysis of EST databases, cDNA , and predicted polypeptide 
sequences: 

Exhaustive homology searches of EST databases with the CEASPl, F21F8.3, 
30 F21F8.4, and F21F8.7 sequences failed to reveal any novel manmialian homologues. 
TBLASTN searches with R12H7.2 showed homology to cathepsin D, cathepsin E, 
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pepsinogen A, pepsinogen C and renin, particularly around the DTG molif within the 
active site, but also failed to identify any additional novel mammalian aspartyl 
proteases. This indicates that the C elegam genome probably contains only a single 
lysosomal aspartyl protease which in mammals is represented by a gene family that 
5 arose through duplication and consequent modification of an ancestral gene. 

TBLASTN searches with Tl 8H9.2, the remaining C. elegans sequence, 
identified several ESTs which assembled into a contig encoding a novel human 
aspartyl protease (Hu-ASPl ). As is described above in Example K BLASTX search 
with the Hu-ASPl contig against SWISS-PROT revealed that the active site motifs in 

10 the sequence aligned with the active sites of other aspartyl proteases. Exhaustive, 
repetitive rounds of BLASTN searches against LifeSeq, LifeSeqFL, and the public 
EST collections identified 102 EST firom multiple cDNA libraries that assembled into 
a single contig. The 51 sequences in this contig found in public EST collections also 
have been assembled into a single contig (THC2 13329) by The Institute for Genome 

1 5 Research (TIGR). The TIGR annotation indicates that they failed to find any hits in 
the database for the contig. Note that the TIGR contig is the reverse complement of 
the LifeSeq contig that we assembled. BLASTN search of Hu-ASPl against the rat 
and mouse EST sequences in ZooSeq revealed one homologous EST in each database 
(Incyte clone 70031 1523 and IMAGE clone 313341, GenBank accession number 

20 W10530, respectively). 

TBLASTN searches with the assembled DNA sequence for Hu-ASPl against 
both LifeSeqFL and the public EST databases identified a second, related human 
sequence (Hu-Asp2) represented by a single EST (2696295). Translation of this 
partial cDNA sequence reveals a single DTG motif which has homology to the active 

25 site motif of a bovine aspartyl protease, NM 1 . 

BLAST searches, contig assemblies and muhiple sequence alignments were 
performed using the bioinformatics tools provided with the LifeSeq, LifeSeqFL and 
LifeSeq Assembled databases from Incyte. Predicted protein motifs were identified 
using either the ProSite dictionary (Motifs in GCG 9) or the Pfam database. 

30 
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Full-length cDNA cloning of Hu-Aspl 

The open reading frame of C elegans gene T18H9.2CE was used to query 
Incyte LifeSeq and LifeSeq-FL databases and a single electronic assembly referred to 
as 1863920CE1 was detected. The 5' most cDNA clone in this contig, 1863920, was 
5 obtained from Incyte and completely sequenced on both strands. Translation of the 
open reading frame contained within clone 1 863920 revealed the presence of the 
duplicated aspartyl protease active site motif (DTG/DSG) but the 5' end was 
incomplete. The remainder of the Hu-Aspl coding sequence was determined by 5' 
Marathon RACE analysis using a human placenta Marathon ready cDNA template 

10 (Clontech). A 3*-antisense oligonucleotide primer specific for the 5' end of clone 
1863920 was paired with the 5 '-sense primer specific for the Marathon ready cDNA 
synthetic adaptor in the PCR. Specific PCR products were directly sequenced by 
cycle sequencing and the resulting sequence assembled with the sequence of clone 
1863920 to yield the complete coding sequence of Hu-Asp-1 (SEQ ID No. 1). 

IS Several interesting features are present in the primary amino acid sequence of 

Hu-Aspl (Figure 1, SEQ ID No. 2). The sequence contains a signal peptide (residues 
1-20 in SEQ ID No. 2), a pro-segment, and a catalytic domain containing two copies 
of the aspartyl protease active site motif (DTG/DSG). The spacing between the first 
and second active site motifs is about 200 residues which should correspond to the 

20 expected size of a single, eukaryotic aspartyl protease domain. More interestingly, the 
sequence contains a predicted transmembrane domain (residues 469-492 in SEQ ID 
No.2) near its C-terminus which suggests that the protease is anchored in the 
membrane. This feature is not found in any other aspartyl protease. 

25 Cloning of a full-length Hu-Asp-l cDNAs: 

As is described above in Example 1, genome wide scan of the Caenorhahditis 
elegans database WormPepl2 for putative aspartyl proteases and subsequent mining 
of human EST databases revealed a human ortholog to the C. elegans gene Tl 8H9.2 
referred to as Hu-Aspl . The assembled contig for Hu-Aspl was used to query for 

30 human paralogs using the BLAST search tool in human EST databases and a single 
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significant match (2696295CE1) with approximately 60% shared identity was found 
in the LifeSeq FL database. Similar queries of either gblOSPubEST or the family of 
human databases available from TIGR did not identify similar EST clones. cDNA 
clone 2696295, identified by single pass sequence analysis from a human uterus 

5 cDNA library, was obtained from Incyte and completely sequence on both strands. 
This clone contained an incomplete 1266 bp open-reading frame that encoded a 422 
amino acid polypeptide but lacked an initiator ATG on the 5' end. Inspection of the 
predicted sequence revealed the presence of the duplicated aspartyl protease active 
site motif DTG/DSG, separated by 194 amino acid residues. Subsequent queries of 

10 later releases of the LifeSeq EST database identified an additional ESTs, sequenced 
from a human astrocyte cDNA library (4386993), that appeared to contain additional 
5* sequence relative to clone 2696295. Clone 4386993 was obtained from Incyte and 
completely sequenced on both strands. Comparative analysis of clone 4386993 and 
clone 2696295 confirmed that clone 4386993 extended the open-reading frame by 31 

15 amino acid residues including two in-fi^e translation initiation codons. Despite the 
presence of the two in-frame ATGs, no in-frame stop codon was observed upstream of 
the ATG indicating that the 4386993 may not be full-length. Furthermore, alignment 
of the sequences of clones 2696295 and 4386993 revealed a 75 base pair insertion in 
clone 2696295 relative to clone 4386993 that results in the insertion of 25 additional 

20 amino acid residues in 2696295. The remainder of the Hu-Asp2 coding sequence was 
determined by 5' Marathon RACE analysis using a human hippocampus Marathon 
ready cDNA template (Clontech). A 3'-antisense oligonucleotide primer specific for 
the shared 5' -region of clones 2696295 and 4386993 was paired with the 5 '-sense 
primer specific for the Marathon ready cDNA synthetic adaptor in the PGR. Specific 

25 PCR products were directly sequenced by cycle sequencing and the resulting sequence 
assembled with the sequence of clones 2696295 and 4386993 to yield the complete 
coding sequence of Hu-Asp2(a) (SEQ ID No. 3) and Hu-Asp2(b) (SEQ ID No. 5), 
respectively. 

Several interesting features are present in the primary amino acid sequence of 
30 Hu.Asp2(a) (Figure 2 and SEQ ID No. 4) and Hu-Asp-2(b) (Figure 3, SEQ ID No, 6). 
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Both sequences contain a signal peptide (residues 1-21 in SEQ ID No. 4 and SEQ ID 
No. 6), a pro-segment, and a catalytic domain containing two copies of the aspartyl 
protease active site motif (DTG/DSG). The spacing between the first and second 
active site motifs is variable due to the 25 amino acid residue deletion in Hu-Asp-2(b) 
5 and consists of 168-ver5i/5-194 amino acid residues, for Hu-Asp2(b) and Hu-Asp-2(a), 
respectively. More interestingly, both sequences contains a predicted transmembrane 
domain (residues 455-477 in SEQ ID No.4 and 430-452 in SEQ ID No. 6) near their 
C-termini which indicates that the protease is anchored in the membrane. This feature 
is not found in any other aspartyl protease except Hu-Aspl . 

10 

Example 3 

Molecular cloning of mouse Asp2 cDNA and genomic DNA. 

Cloning and characterization of murine Asp2 cDNA, 

The murine ortholog of Hu-Asp2 was cloned using a combination of cDNA 

1 5 library screening, PCR, and genomic cloning. Approximately 500,000 independent 
clones from a mouse brain cDNA library were screened using a ^^P-labeled coding 
sequence probe prepared from Hu-Asp2. Replicate positives were subjected to DNA 
sequence analysis and the longest cDNA contained the entire 3* untranslated region 
and 47 amino acids in the coding region. PCR amplification of the same mouse brain 

20 cDNA library with an antisense oligonucleotide primer specific for the 5*-most cDNA 
sequence determined above and a sense primer specific for the 5' region of human 
Asp2 sequence followed by DNA sequence analysis gave an additional 980 bp of the 
coding sequence. The remainder of the 5' sequence of murine Asp-2 was derived from 
genomic sequence (see below). 

25 

Isolation and sequence analysis of the murine Asp-2 gene. 

A murine EST sequence encoding a portion of the murine Asp2 cDNA was 
identified in the GenBank EST database using the BLAST search tool and the 
Hu-Asp2 coding sequence as the query. Clone g3 160898 displayed 88% shared 
30 identity to the human sequence over 352 bp. Oligonucleotide primer pairs specific for 
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this region of murine Asp2 were then synthesized and used to amplify regions of the 
murine gene. Murine genomic DNA, derived from strain 129/SvJ, was amplified in 
the PCR (25 cycles) using various primer sets specific for murine Asp2 and the 
products analyzed by agarose gel electrophoresis. The primer set Zoo-1 and Zoo-4 
5 amplified a 750 bp fragment that contained approximately 600 bp of intron sequence 
based on comparison to the known cDNA sequence. This primer set was then used to 
screen a murine BAC library by PCR, a single genomic clone was isolated and this 
cloned was confirmed contain the murine Asp2 gene by DNA sequence analysis. 
Shotgun DNA sequencing of this Asp2 genomic clone and comparison to the cDNA 

1 0 sequences of both Hu-Asp2 and the partial murine cDNA sequences defined the 
full-length sequence of murine Asp2 (SEQ ID No. 7). The predicted amino acid 
sequence of murine Asp2 (SEQ ID No. 8) showed 96.4% shared identity (GCG 
BestFit algorithm) with 18/501 amino acid residue substitutions compared to the 
human sequence (Figure 4). The proteolytic processing of murine Asp2(a) is believed 

15 to be analogous to the processing described above for human Asp2(a). In addition, a 
variant lacking amino acid residues 190-214 of SEQ ID NO: 8 is specifically 
contemplated as a murine Asp2(b) polypeptide. All forms of murine Asp2(b) gene 
and protein are intended as aspects of the invention. 

20 Example 4 

Tissue Distribution of Expression of Hu-Asp2 Transcripts 

Materials and Methods: 

The tissue distribution of expression of Hu-Asp-2 was determined using 
multiple tissue Northern blots obtained from Clontech (Palo Alto, CA). Incyte clone 

25 2696295 in the vector pINCY was digested to completion with EcoRVNotl and the 1 .8 
kb cDNA insert purified by preparative agarose gel electrophoresis. This fragment 
was radiolabeled to a specific activity > 1 X 10' dpm/pg by random priming in the 
presence of [a-^^P-dATP] (>3000 Ci/mmol, Amersham, Arlington Heights, IL) and 
Klenow fragment of DNA polymerase 1. Nylon filters containing denatured, size 

30 fractionated poly A* RNAs isolated from different human tissues were hybridized 
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with 2x10* dpm/ml probe in ExpressHyb buffer (Clontech, Palo Alto, CA) for 1 hour 
at 68 X and washed as recommended by the manufacture. Hybridization signals were 
visualized by autoradiography using' BioMax XR film (Kodak, Rochester, NY) with 
intensifying screens at -80 ''C. 

5 

Results and Discussion: 

Limited information on the tissue distribution of expression of Hu-Asp-2 
transcripts was obtained from database analysis due to the relatively small number of 
ESTs detected using the methods described above (< 5). In an effort to gain further 

1 0 information on the expression of the Hu-Asp2 gene. Northern analysis was employed 
to determine both the size(s) and abundance of Hu-Asp2 transcripts. PolyA* RNAs 
isolated from a series of peripheral tissues and brain regions were displayed on a solid 
support following separation under denaturing conditions and Hu-Asp2 transcripts 
were visualized by high stringency hybridization to radiolabeled insert from clone 

1 5 2696295. The 2696295 cDNA probe visualized a constellation of transcripts that 

migrated with apparent sizes of 3.0kb, 4.4 kb and 8.0 kb with the latter Uvo transcript 
being the most abundant. 

Across the tissues surveyed, Hu-Asp2 transcripts were most abundant in 
pancreas and brain with lower but detectable levels observed in all other tissues 

20 examined except thymus and PBLs. Given the relative abundance of Hu-Asp2 

transcripts in brain, the regional expression in brain regions was also established. A 
similar constellation of transcript sizes were detected in all brain regions examined 
[cerebellum, cerebral cortex, occipital pole, frontal lobe, temporal lobe and putamen] 
with the highest abundance in the medulla and spinal cord. 

25 

Example 5 

Northern Blot Detection of HuAsp-1 and 
HuAsp-2 Transcripts in Human Cell Lines 

A variety of human cell lines were tested for their ability to produce Hu-Aspl 

30 and Asp2 mRNA. Human embryonic kidney (HEK-293) cells, African green monkey 

(Cos-7) cells, Chinese hamster ovary (CHO) cells, HELA cells, and the 
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neuroblastoma cell line lMR-32 were all obtained from the ATCC. Cells were 
cultured in DME containing 10% FCS except CHO cells which were maintained in 
a-MEM/10% FCS at 37 °C in 5% COp until they were near confluence. Washed 
monolayers of cells (3 X 10') were lysed on the dishes and poly RNA extracted 
5 using the Qiagen Oligotex Direct mRNA kit. Samples containing 2 ^ig of poly A* 
RNA from each cell line were fractionated under denaturing conditions 
(glyoxal-treated), transferred to a solid nylon membrane support by capillary action, 
and transcripts visualized by hybridization with random-primed labeled (^^P) coding 
sequence probes derived from either Hu-Aspl or Hu-Asp2. Radioactive signals were 
1 0 detected by exposure to X-ray film and by image analysis with a Phosphorlmager. 

The Hu-Aspl cDNA probe visualized a similar constellation of transcripts (2.6 
kb and 3.5 kb) that were previously detected is human tissues. The relative abundance 
determined by quantification of the radioactive signal was Cos-7 > HEK 292 = HELA 
> IMR32. 

1 5 The Hu-Asp2 cDNA probe also visualized a similar constellation of transcripts 

compared to tissue (3.0 kb, 4.4 kb, and 8.0 kb) with the following relative abundance; 
HEK 293 > Cos 7 > IMR32 > HELA. 

Example 6 

20 Modiflcation of AFP to increase Ap processing for in vitro screening 

Human cell lines that process Ap peptide from AFP provide a means to screen 
in cellular assays for inhibitors of P- and y-secretase. Production and release of Ap 
peptide into the culture supernatant is monitored by an enzyme-linked immunosorbent 
assay (ElA). Although expression of APP is widespread and both neural and 

25 non*neuronal cell lines process and release Ap peptide, levels of endogenous APP 

processing are low and difficult to detect by EIA. AP processing can be increased by 
expressing in transformed cell lines mutations of APP that enhance Ap processing. 
We made the serendipitous observation that addition of two lysine residues to the 
carboxyl terminus of APP695 increases AP processing still further. This allowed us 
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to create a transformed cell line that releases Ap peptide into the culture medium at 
the remarkable level of 20,000 pg/ml. 
Materials And Methods 

Materials: 

5 Human embryonic kidney cell line 293 (HEK293 cells) were obtained 

internally. The vector pIRES-EGFP was purchased from Clontech. Oligonucleotides 
for mutation using the polymerase chain reaction (PGR) were purchased from 
Genosys. A plasmid containing human APP695 (SEQ E) No. 9 [nucleotide] and SEQ 
ID No. 10 [amino acid]) was obtained from Northwestern University Medical School. 
10 This was subcloned into pSK (Stratagene) at the Not\ site creating the plasmid 
pAPP695. 

Mutagenesis protocol: 

The Swedish mutation (K670N, M671L) was introduced into pAPP695 using 
the Stratagene Quick Change Mutagenesis Kit to create the plasmid pAPP695NL 
1 5 (SEQ ID No. 1 1 [nucleotide] and SEQ ID No. 1 2 [amino acid]). To introduce a 
di-lysine motif at the C-terminus of APP695, the forward primer #276 5' 
GACTGACCACTCGACCAGGTTC (SEQ ID No. 47) was used with the "patch" 
primer #274 5' 

CGAATTAAATTCCAGCACACTGGCTACTTCTTGTTCTGCATCTCAAAGAAC 

20 (SEQ ID No. 48) and the flanking primer #275 

CGAATTAAATTCCAGCACACTGGCTA (SEQ ID No. 49) to modify the 3' end of 
the APP695 cDNA (SEQ ID No. 15 [nucleotide] and SEQ ED No. 16 [amino acid]). 
This also added a BstXl restriction site that will be compatible with the BslXl site in 
the multiple cloning site of pIRES-EGFP. PGR amplification was performed with a 

25 Clontech HF Advantage cDNA PGR kit using the polymerase mix and buffers 

supplied by the manufacturer. For "patch" PGR, the patch primer was used at l/20th 
the molar concentration of the flanking primers. PGR amplification products were 
purified using a QlAquick PGR purification kit (Qiagen). Afler digestion with 
restriction enzymes, products were separated on 0.8% agarose gels and then excised 

30 DNA fragments were purified using a QlAquick gel extraction kit (Qiagen). 
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To reassemble a modified APP695-Sw cDNA, the 5* Notl-Bgl2 fragment of 
the APP695-SW cDNA and the 3' BgI2-BstXl APP695 cDNA fragment obtained by 
PCR were hgated into pIRES-EGFP plasmid DNA opened at the Notl and BstXl 
sites. Ligations were performed for 5 minutes at room temperature using a Rapid 
5 DNA Ligation kit (Boehringer Mannheim) and transformed into Library Efficiency 
DH5a Competent Cells (GibcoBRL Life Technologies). Bacterial colonies were 
screened for inserts by PCR amplification using primers #276 and #275. Plasmid 
DNA was purified for mammalian cell transfection using a QL\prep Spin Miniprep 
kit (Qiagen). The construct obtained was designated pMG 125.3 (APPSW-KK, SEQ 

10 ID No. 1 7 [nucleotide] and SEQ ID No. 1 8 [amino acid]). 
Mammalian Cell Transfection: 

HEK293 cells for transfection were grown to 80% confluence in Dulbecco's 
modified Eagle's medium (DMEM) with 10% fetal bovine serum. Cotransfections 
were performed using Lipofect Amine (Gibco-BRL) with 3 \ig pMG 125.3 DNA and 9 

1 5 iig pcDN A3. 1 DNA per 1 0 x 1 0* cells. Three days posttransfection, cells were 

passaged into medium containing G418 at a concentration of 400 ^g/ml. Afler three 
days growth in selective medium, cells were sorted by their fluorescence. 
Clonal Selection of 125.3 cells by FACS: 

Cell samples were analyzed on an EPICS Elite ESP flow cytometer (Coulter, 

20 Hialeah, FL) equipped with a 488 nm excitation line supplied by an air-cooled argon 
laser. EGFP emission was measured through a 525 nm band-pass filter and 
fluorescence intensity was displayed on a 4-decade log scale after gating on viable 
cells as determined by forward and right angle light scatter. Single green cells were 
separated into each well of one 96 well plate containing growth medium without 

25 G41 8. After a four day recovery period, G41 8 was added to the medium to a final 
concentration of 400 (Xg/ml. After selection, 32% of the wells contained expanding 
clones. Wells with clones were expanded from the 96 well plate to a 24 well plate 
and then a 6 well plate with the fastest growing colonies chosen for expansion at each 
passage. The final cell line selected was the fastest growing of the final six passaged. 

30 This clone, designated 12S.3, has been maintained in G418 at 400 ug/ml with passage 
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every four days into fresh medium. No loss of AP production of EGFP fluorescence 
has been seen over 23 passages. 

ApEIA Analysis (Double Antibody Sandwich ELISAfor hAfi 1-40/42): 

Cell culture supematants harvested 48 hours after transfection were analyzed 
5 in a standard AP ElA as follows. Human AP 1-40 or 1-42 was measured using 

monoclonal antibody (mAb) 6E10 (Senetek, St. Louis, MO) and biotinylated rabbit 
antiserum 162 or 164 (New York State Institute for Basic Research, Staten Island, 
NY) in a double antibody sandwich ELISA. The capture antibody 6E1 0 is specific to 
an epitope present on the N-terminal amino acid residues 1-16 of hAp. The 

10 conjugated detecting antibodies 162 and 164 are specific for hAP 1-40 and 1-42, 
respectively. Briefly, a Nunc Maxisorp 96 well immunoplale was coated with 100 
|il/well of mAb 6E10 (5pg/ml) diluted in O.IM carbonate-bicarbonate buffer, pH 9.6 
and incubated at 4T overnight. After washing the plate 3x with 0.01 M DPBS 
(Modified Dulbecco's Phosphate Buffered Saline (0.008M sodium phosphate, 0.002M 

1 5 potassium phosphate, 0. 14M sodium chloride, 0.01 M potassium chloride, pH 7.4) 
from Pierce, Rockford, II) containing 0.05% of Tween-20 (DPBST), the plate was 
blocked for 60 minutes with 200 |il of 10% normal sheep serum (Sigma) in 0.01 M 
DPBS to avoid non-specific binding. Human AP 1-40 or 1-42 standards 100 pl/well 
(Bachem, Torrance, CA) diluted, from a Img/ml stock solution in DMSO, in culture 

20 medium was added after washing the plate, as well as 1 00 |il/well of sample, eg., 
conditioned medium of transfected cells. 

The plate was incubated for 2 hours at room temperature and 4®C overnight. 
The next day, after washing the plate, 100 ^l/well biotinylated rabbit antiserum 162 
1 :400 or 164 1 :50 diluted in DPBST + 0.5% BSA was added and incubated at room 

25 temperature for Ihour, 15 minutes. Following washes, 100 ^l/well 

neutravidin-horseradish peroxidase (Pierce, Rockford, II) diluted 1:10,000 in DPBST 
was applied and incubated for 1 hour at room temperature. After the last washes 100 
^ll/welI of o-phenylnediamine dihydrochloride (Sigma Chemicals, St. Louis, MO) in 
50mM citric acid/1 OOmM sodium phosphate buffer (Sigma Chemicals, St. Louis, 

30 MO), pH 5.0, was added as substrate and the color development was monitored at 
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450nm in a kinetic microplate reader for 20 minutes using Soft max Pro software. All 
standards and samples were run in triplicates. The samples with absorbance values 
falling within the standard curve were extrapolated from the standard curves using 
Soft max Pro software and expressed in pg/ml culture medium. 
5 Results: 

Addition of two lysine residues to the carboxyl terminus of APP695 greatly 
increases AP processing in HEK293 cells as shown by transient expression (Table 1). 
Addition of the di-lysine motif to APP695 increases AP processing to that seen with 
the APP695 containing the Swedish mutation. Combining the di-lysine motif with the 

10 Swedish mutation further increases processing by an additional 2.8 fold. 

Cotransformation of HEK293 cells with pMG125.3 and pcDNA3.1 allowed 
dual selection of transformed cells for G418 resistance and high level expression of 
EGFP. After clonal selection by FACS, the cell line obtained, produces a remarkable 
20,000 pg Ap peptide per ml of culture medium after growth for 36 hours in 24 well 

15 plates. Production of Ap peptide under various growth conditions is summarized in 
Table 2. 



20 



25 



30 
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TABLE I 

Release of AP peptide into the culture medium 48 hours after transient 
5 transfection of HEK293 cells with the indicated vectors containing wildtype or 

modified APP. Values tabulated are mean + SD and P-value for pairwise comparison 
using Student's t-test assuming unequal variances. 



10 



20 



25 



APP Construct 


AP 1-40 peptide 
(pg/ml) 


Fold Increase 


P-value 


pIRES-EGFP vector 


147 + 28 


1.0 




wtAPP695 (142.3) 


194+15 


1.3 


0.051 


wtAPP695-KK (124.1) 


424 + 34 


2.8 


3 X 10-5 


APP695-SW (143.3) 


457 + 65 


3.1 


2x 10-3 


APP695-SwKK (125.3) 


1308 + 98 


8.9 


3 X 10-4 
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TABLE 2 

Release of Ap peptide from HEK125.3 cells under various growth conditions. 



10 



Type of Culture 


Volume of 


Duration of 


Ap 1-40 


Ap 1-42 


Plate 


Medium 


Culture 


(pg/ml) 


(pg/ml) 


24 well plate 


400 ul 


36 hr 


28,036 


1,439 



Example 7 

Antisense oligomer inhibition of Abeta processing in HEK125J cells 

1 5 The sequences of Hu- Asp 1 and Hu- Asp2 were provided to Sequi tur, Inc 

(Natick, MA) for selection of targeted sequences and design of 2nd generation 
chimeric antisense oligomers using prorietaiy technology (Sequitur Ver. D Pat 
pending #3002). Antisense oligomers Lot# S644, S645, S646 and S647 were targeted 
against Aspl. Antisense oligomers LotU S648, S649, S6S0 and S6S1 were targeted 

20 against Asp2. Control antisense oligomers Lot# S6S2, S6S3, S6SS, and S674 were 
targeted against an irrelevant gene and antisense oligomers Lot #S6S6, S657, S6S8, 
and S6S9 were targeted against a second irrelevant gene. 

For transfection with the antisense oligomers, HEK125.3 cells were grown to 
about 50% confluence in 6 well plates in Minimal Essential Medium (MEM) 

25 supplemented with 10% fetal calf serum. A stock solution of oligofectin G (Sequitur 
Inc., Natick, MA) at 2 mg/ml was diluted to 50 jig/ml in serum free MEM. 
Separately, the antisense oligomer stock solution at 100 ^xM was diluted to 800 nM in 
Opti-MEM (GBCO-BRL, Grand Island, NY). The diluted stocks of oligofectin G 
and antisense oligomer were then mixed at a ratio of 1 : 1 and incubated at room 

30 temperature. After 15 minutes incubation, the reagent was diluted 10 fold into MEM 
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containing 10% fetal calf serum and 2 ml was added to each well of the 6 well plate 
after first removing the old medium. After transfection, cells were grown in the 
continual presence of the oligofectin G/antisense oligomer. To monitor Ap peptide 
release, 400 ^il of conditioned medium was removed periodically from the culture 
5 well and replaced with fresh medium beginning 24 hours after transfection. Ap 
peptides in the conditioned medium were assayed via immunoprecipitation and 
Western blotting. Data reported are from culture supematants harvested 48 hours 
after transfection. 

The 16 different antisense oligomers obtained from Sequitur Inc. were 
1 0 transfected separately into HEK 1 25.3 cells to determine their affect on Ap peptide 
processing. Only antisense oligomers targeted against Asp2 significantly reduced 
Abeta processing by HEK125.3 cells. Both Ap (1-40) and Ap (1-42) were inhibited 
by the same degree. In Table 3, percent inhibition is calculated with respect to 
untransfected cells. Antisense oligomer reagents giving greater than 50% inhibition 
15 are marked with an asterisk. For ASP2, 4 of 4 antisense oligomers gave greater than 
50% inhibition with an average inhibition of 62% for AP 1-40 processing and 60% for 
AP 1-42 processing. 
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10 



30 



TABLE 3 

Inhibition of AP peptide release from HEKl 25.3 cells treated with antisense 
oligomers. 

Gene Targeted Antisense Oligomer Abela(l-40) Abeta(l-42) 

Asp2-1 S648 71%* 67%* 

Asp2-2 S649 83%* 76%* 

Asp2-3 S650 46%* 50%* 



Asp2-4 S651 47%* 46%* 

15 Conl-1 S652 13% 18% 

Conl-2 S653 35% 30% 

Con 1-3 S655 9% 18% 

20 

Conl-4 S674 29% 18% 

Con2-l S656 12% 18% 

25 Con2-2 S657 16% 19% 

Con2-3 S658 8% 35% 



Con2-4 S659 3% 18% 

Since HEK293 cells derive from kidney, the experiment was extended to 
human IMR-32 neuroblastoma cells which express all three APP isofoms and which 
release Ap peptides into conditioned medium at measurable levels. [See Neill et ai, 
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/ NeuroSci. Res., (1994) 39: 482-93; and Asami-Odaka et al, Biochem., (1995) 
5^:10272-8.] Essentially identical results were obtained in the neuroblastoma cells as 
the HEK293 cells. As shown in Table 3B, the pair of Asp2 antisense oligomers 
reduced Asp2 mRNA by roughly one-half, while the pair of reverse control oligomers 
5 lacked this effect (Table 3B). 

Table 3B 

Reduction of AP40 and AP42 in human neuroblastoma IMR-32 cells and mouse 
neuroblastoma Neuro-2A cells treated with Asp2 antisense and control oligomers as 
10 indicated. Oligomers were transfected in quadruplicate cultures. Values tabulated are 
normalized against cultures treated with oligofectin-G™ only (mean + SD, ** 
p<0.001 compared to reverse control oligomer). 





IMR-32 cells 


Neuro-2A cells 




Asp2 
mRNA 


AP40 


Ap42 


Ap40 


AP42 


Asp2-1A 


-75% 


-49 + 2%** 


-42 + 
14%** 


-70 + 
7%** 


-67 + 2%** 


Asp2-1R 


0.16 


-0 + 3% 


21.26 


-9 + 15% 


1.05 


Asp2-2A 


-39% 


-43 + 3%** 


-44+18%** 


-61 

+12%** 


-61 +12%** 


Asp2-2R 


0.47 


12.2 


19.22 


6.15 


-8 + 10% 



20 

Together with the reduction in Asp2 mRNA there was a concomitant reduction in the 
release of Ap40 and Ap42 peptides into the conditioned medium. Thus, Asp2 
functions directly or indirectly in a human kidney and a human neuroblastoma cell 
line to facilitate the processing of APP into Ap peptides. Molecular cloning of the 
25 mouse Asp2 cDNA revealed a high degree of homology to human (>96% amino acid 
identity, see Example 3), and indeed, complete nucleotide identity at the sites targeted 
by the Asp2-1 A and Asp2-2A antisense oligomers. Similar results were obtained in 
mouse Neuro-2a cells engineered to express APP-Sw-KK. The Asp2 antisense 
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oligomers reduced release of AP peptides into the medium while the reverse control 
oligomers did not (Table 3B). Thus, the three antisense experiments with HEK293, 
llVlR-32 and Neuro-2a cells indicate, that Asp2 acts directly or indirectly to facilitate 
AP processing in both somatic and neural cell lines. 

5 

Example 8 

Demonstration of Hu-Asp2 P-Secretase Activity in Cultured Cells 

Several mutations in APP associated with early onset Alzheimer's disease 
have been shown to alter AP peptide processing. These flank the - and C-terminal 

1 0 cleavage sites that release Ap from APP. These cleavage sites are referred to as the 
P-secretase and y-secretase cleavage sites, respectively. Cleavage of APP at the 
P-secretase site creates a C-terminal fragment of APP containing 99 amino acids of 
1 1,145 daltons molecular weight. The Swedish KM-NL mutation immediately 
upstream of the P-secretase cleavage site causes a general increase in production of 

15 both the 1-40 and 1-42 amino acid forms of AP peptide. The London VF mutation 
(V71 7-F in the APP770 isoform) has little effect on total Ap peptide production, but 
appears to preferentially increase the percentage of the longer 1-42 amino acid form of 
Ap peptide by affecting the choice of P-secretase cleavage site used during APP 
processing. Thus, we sought to determine if these mutations altered the amount and 

20 type of Ap peptide produced by cultured cells cotransfected with a construct directing 
expression of Hu-Asp2. 

Two experiments were performed which demonstrate Hu-Asp2 P-secretase 
activity in cultured cells. In the first experiment, treatment of HEK 125.3 cells with 
antisense oligomers directed against Hu-Asp2 transcripts as described in Example 7 

25 was found to decrease the amount of the C-terminal fragment of APP created by 

P-secretase cleavage (CTF99) (Figure 9). This shows that Hu-Asp2 acts directly or 
indirectly to facilitate P-secretase cleavage. In the second experiment, increased 
expression of Hu-Asp2 in transfecled mouse Neuro2A cells is shown to increase 
accumulation of the CTF99 P-secretase cleavage fragment (Figure 10). This increase 

30 is seen most easily when a mutant APP-KK clone containing a C-lerminal di-lysine 
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motif is used for iransfection. A further increase is seen when Hu-Asp2 is 
cotransfecled with APP-Sw-KK containing the Swedish mutation KM -NL. The 
Swedish mutation is known to increase cleavage of APP by the P-secretase. 

A second set of experiments demonstrate Hu-Asp2 facilitates y-sccretase 
5 activity in cotransfection experiments with human embryonic kidney HEK293 cells. 
Cotransfection of Hu-Asp2 with an APP-KK clone greatly increases production and 
release of soluble Api-40 and Api -42 peptides from HEK293 cells. There is a 
proportionately greater increase in the release of Api -42. A further increase in 
production of Api-42 is seen when Hu-Asp2 is cotransfected with APP-VF (SEQ ID 

10 No. 13 [nucleotide] and SEQ ID No. 14 [amino acid]) or APP-VF-KK SEQ ID No. 19 
[nucleotide] and SEQ ID No. 20 [amino acid]) clones containing the London mutation 
V717-F. TheV717-F mutation is known to alter cleavage specificity of the APP 
Y-secretase such that the preference for cleavage at the AP42 site is increased. Thus, 
Asp2 acts directly or indirectly to facilitate y-secretase processing of APP at the P42 

15 cleavage site. 
Materials 

Antibodies 6E10 and 4G8 were purchased from Senetek (St. Louis, MO). 
Antibody 369 was obtained from the laboratory of Paul Greengard at the Rockefeller 
University. Antibody C8 was obtained from the laboratory of Dennis Selkoe at the 
20 Harvard Medical School and Brigham and Women's Hospital. 
APP Constructs used 

The APP constructs used for transfection experiments comprised the following 

APP: wild-type APP695 (SEQ ID No. 9 and No. 10) 

APP-Sw: APP695 containing the Swedish KM-NL mutation (SEQ ID No. 1 1 
25 and No. 12 , wherein the lysine (K) at residue 595 of APP695 is changed to 

asparagine (N) and the methionine (M) at residue 596 of APP695 is changed to 
leucine (L).), 

APP-VF: APP695 containing the London V-F mutation (SEQ ED Nos. 13 & 
14) (Affected residue 71 7 of the APP770 isoform corresponds with residue 642 of the 
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APP695 isoform. Thus, APP-VF as set in SEQ ID NO: 14 comprises the APP695 
sequence, wherein the vahne (V) at residue 642 is changed to phenylalanine (F).) 

APP-KK: APP695 containing a C-terminal KK motif (SEQ ID Nos. 15 & 16), 
APP-Sw-KK: APP695-SW containing a C-terminal KK motif (SEQ ID No. 1 7 

5 & 18), 

APP-VF-KK: APP695-VF containing a C-terminal KK motif (SEQ ID Nos. 
19&20). 

These were inserted into the vector pIRES-EGFP (Clontech, Palo Alto CA) 
between the Noil and BsiXl sites using appropriate linker sequences introduced by 
10 PGR. 

Transfection of antisense oligomers or plasmid DNA constructs in HEK293 cells. 
HEK1253 cells andNeuro-2A cells. 

Human embryonic kidney HEK293 cells and mouse Neuro-2a cells were 

1 5 transfected with expression constructs using the Lipofectamine Plus reagent from 

Gibco/BRL. Cells were seeded in 24 well tissue culture plates to a density of 70-80% 
confluence. Four wells per plate were transfected with 2 ^g DNA (3:1, 
APPxotransfectant), 8 ^il Plus reagent, and 4 \i\ Lipofectamine in OptiMEM. 
OptiMEM was added to a total volume of 1 ml, distributed 200 ^il per well and 

20 incubated 3 hours. Care was taken to hold constant the ratios of the two plasmids used 
for cotransfection as well as the total amount of DNA used in the transfection. The 
transfection media was replaced with DMEM, 10%FBS, NaPyruvate, with 
antibiotic/antimycotic and the cells were incubated under normal conditions (37^C, 
5% CO2) for 48 hours. The conditioned media were removed to polypropylene tubes 

25 and stored at -SOX until assayed for the content of Api-40 and Apl-42 by ElA as 
described in the preceding examples. Transfection of antisense oligomers into 
HEK 125.3 cells was as described in Example 7. 
Preparation of cell extracts. Western blot protocol 

Cells were harvested after being transfected with plasmid DNA for about 60 

30 hours. First, cells were transferred to 1 5-mI conical tube from the plate and 
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centrifuged at 1 ,500 rpm for 5 minutes to remove the medium. The ceJl pellets were 
washed once with PBS. We then lysed the cells with lysis buffer (10 mM HEPES, pH 
7.9, 1 50 mM NaCl, 1 0% glycerol, 1 mM EGTA, 1 mM EDTA, 0. 1 mM sodium 
vanadate and 1% NP-40). The lysed cell mixtures were centrifuged at 5000 rpm and 
5 the supernatant was stored at -20T as the cell extracts. Equal amounts of extracts 

from HEK125.3 cells transfected with the Asp2 antisense oligomers and controls were 
precipitated with antibody 369 that recognizes the C-tenninus of APP and then CTF99 
was detected in the immunoprecipitate with antibody 6E10. The experiment was 
repeated using C8, a second precipitating antibody that also recognizes the C-terminus 

10 of APP. For Western blot of extracts from mouse Neuro-2a cells cotransfected with 
Hu-Asp2 and APP-KK, APP-Sw-KK, APP-VF-KK or APP-VF, equal amounts of cell 
extracts were electrophoresed through 4-10% or 10-20% Tricine gradient gels 
(NOVEX, San Diego, CA). Full length APP and the CTF99 P-secretase product were 
detected with antibody 6E10. 

15 Results 

Transfection of HEK125.3 cells with Asp2-1 or Asp2-2 antisense oligomers 
reduces production of the CTF P-secretase product in comparison to cells similarly 
transfected with control oligomers having the reverse sequence (Asp2-1 reverse & 
Asp2-2 reverse), see Figure 9. Correspondingly, cotransfection of Hu-Asp2 into 

20 mouse Neuro-2a cells with the APP-KK construct increased the formation of CTF99. 
(See Fig. 10.) This was further increased if Hu-Asp2 was coexpressed with 
APP-Sw-KK, a mutant form of APP containing the Swedish KM-NL mutation that 
increases P-secretase processing. 

Effects of Asp2 on the production of Ab peptides from endogenously 

25 expressed APP isoforms were assessed in HEK293 cells transfected with a construct 
expressing Asp2 or with the empty vector after selection of transformants with the 
antibiotic G41 8. AP40 production was increased in cells transformed with the Asp2 
construct in comparison to those transformed with the empty vector DNA. AP40 
levels in conditioned medium collected from the Asp2 transformed and control 

30 cultures was 424 ± 45 pg /ml and 1 1 3 ± 58 pg/ml, respectively (p<0.001 ). Ap42 
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release was below the limit of detection by the ElA, while the release of sAPPa was 
unaffected, 112 ± 8 ng/ml versus 1 1 1 ± 40 ng/ml. This further indicates that Asp2 
acts directly or indirectly to facilitate the processing and release of Ap from 
endogenously expressed APP. 
5 Co-transfection of Hu-Asp2 with APP has little effect on AP40 production but 

increases AP42 production above background (Table 4). Addition of the di-lysine 
motif to the C-terminus of APP increases AP peptide processing about two fold, 
although AP40 and Ap42 production remain quite low (352 pg/ml and 21 pg/ml, 
respectively). Cotransfection of Asp2 with APP-KK further increases both AP40 and 

10 Ap42 production. 

The APP V71 7-F mutation has been shown to increase y-secretase processing 
at the AP42 cleavage site. Cotransfection of Hu-Asp2 with the APP-VF or 
APP-VF-KK constructs increased AP42 production (a two fold increase with APP-VF 
and a four-fold increase with APP-VF-KK, Table 4), but had mixed effects on Ap40 

1 5 production (a slight decrease with APP-VF, and a two fold increase with APP-VF-KK 
in comparison to the pcDNA cotransfection control. Thus, the effect of Asp2 on 
AP42 production was proportionately greater leading to an increase in the ratio of 
AP42/total Ab. Indeed, the ratio of Ap42/total Ap reaches a very high value of 42% in 
HEK293 cells cotransfected with Hu-Asp2 and APP-VF-KK. 

20 



25 



30 
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Table 4 

Results of coiransfecting Hu-Asp2 or pcDNA plasmid DNA with various APP 
constructs containing the V717-F mutation that modifies y-secretase processing. 
5 Cotransfection with Asp2 consistently increases the ratio of AP42/total Ap. Values 
tabulated are Ap peptide pg/ml. 



pcDNA Asp2 
Cotransfection Cotransfection 







AP40 


AP42 


Ap42/Tot 
al 


AP40 


Ap42 


Ap42/Tot 
al 


10 


















APP 


192+1 
8 


<4 


<2% 


188±40 


8±10 


3.9% 




APP-VF 


118+1 
5 


15+19 


11.5% 


85±7 


24+12 


22.4% 


15 


APP-KK 


352±2 
4 


21+6 


5.5% 


1062±101 


226±4 
9 


17.5% 




APP-VF-K 


230+3 


88±24 


27.7% 


491±35 


355±3 


42% 
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1 








6 




20 

















-68- 



wo 01/49097 



PCT/IBOl/00797 



Example 9 
Bacterial expression of human Asp2(a) 
Expression of recombinant Hu-Asp2(a) in E. coli. 

Hu-Asp2(a) can be expressed in £. coli after addition of N-terminal sequences 
5 such as a T7 tag (SEQ ID No. 21 and No. 22) or a T7 tag followed by a caspase 8 
leader sequence (SEQ E) No. 23 and No. 24). Alternatively, reduction of the GC 
content of the 5' sequence by site directed mutagenesis can be used to increase the 
yield of Hu-Asp2 (SEQ ID No. 25 and No. 26). In addition, Asp2(a) can be 
engineered with a proteolytic cleavage site (SEQ ID No. 27 and No. 28). To produce 
1 0 a soluble protein after expression and refolding, deletion of the transmembrane 
domain and cytoplasmic tail, or deletion of the membrane proximal region, 
transmembrane domain, and cytoplasmic tail is preferred. Any materials (vectors, 
host cells, etc.) and methods described herein to express Hu-Asp2(a) should in 
principle be equally effective for expression of Hu-Asp2(b). 
15 Methods 

PCR with primers containing appropriate linker sequences was used to 
assemble fiisions of Asp2(a) coding sequence with N-terminal sequence modifications 
including a T7 tag (SEQ ID Nos. 21 and 22) or a T7-caspase 8 leader (SEQ ID Nos. 
23 and 24). These constructs were cloned into the expression vector pet23a(+) 

20 [Novagen] in which a T7 promoter directs expression of a T7 tag preceding a 

sequence of multiple cloning sites. To clone Hu-Asp2 sequences behind the T7 leader 
of pet23a+, the following oligonucleotides were used for amplification of the selected 
Hu-Asp2(a) sequence: #553=GTGGATCCACCCAGCACGGCATCCGGCTG (SEQ 
ID No. 35), #554=GAAAGCTTTCATGACTCATCTGTCTGTGGAATGTTG (SEQ 

25 ED No. 36) which placed BamHI and Hindlll sites flanking the 5' and 3* ends of the 

insert, respectively. The Asp2(a) sequence was amplified from the full length Asp2(a) 
cDNA cloned into pcDNA3.I using the Advantage-GC cDNA PCR [Clontech] 
following the manufacturer's supplied protocol using annealing & extension at 68*C in 
a two-step PCR cycle for 25 cycles. The insert and vector were cut with BamHI and 

30 Hindlll, purified by electrophoresis through an agarose gel. then ligated using the 
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Rapid DNA Ligation kit [Boerhinger Mannheim]. The ligation reaction was used to 
transform the E. coli strain JM109 (Promega) and colonies were picked for the 
purification of plasmid (Qiagen,Qiaprep minispin) and DNA sequence analysis . For 
inducible expression using induction with isopropyl b-D-thiogalactopyranoside 
5 (IPTG), the expression vector was transferred into E. coli strain BL21 (Statagene). 

Bacterial cultures were grown in LB broth in the presence of ampicillin at 100 ug/ml, 
and induced in log phase growth at an OD600 of 0.6-LO with 1 mM IPTG for 4 hour 
at 37"C. The cell pellet was harvested by centrifugation. 

To clone Hu-Asp2 sequences behind the 11 tag and caspase leader (SEQ ID 
1 0 Nos. 23 and 24), the construct created above containing the T7-Hu-Asp2 sequence 

(SEQ E) Nos. 21 and 22) was opened at the BamH 1 site, and then the phosphorylated 
caspase 8 leader oligonucleotides 

#559=GATCGATGACTATCTCTGACTCTCCGCGTGAACAGGACG (SEQ ID No. 
37), #560=GATCCGTCCTGTTCACGCGGAGAGTCAGAGATAGTCATC (SEQ 

15 ID No. 38) were annealed and ligated to the vector DNA. The 5' overhang for each set 
of oligonucleotides was designed such that it allowed ligation into the BamHI site but 
not subsequent digestion with BamHI. The ligation reaction was transformed into 
JM109 as above for analysis of protein expression after transfer to E. coli strain BL21 . 
In order to reduce the GC content of the 5' terminus of asp2(a), a pair of 

20 antiparallel oligos were designed to change degenerate codon bases in 15 amino acid 
positions from QIC to A/T (SEQ ID Nos. 25 and 26). The new nucleotide sequence at 
the 5' end of asp2 did not change the encoded amino acid and was chosen to optimize 
£. Coli expression. The sequence of the sense linker is 5' 

CGGCATCCGGCTGCCCCTGCGTAGCGGTCTGGGTGGTGCTCCACTGGGTCT 
25 GCGTCTGCCCCGGGAGACCGACGAA G 3' (SEQ ID No. 39). The sequence of 
the antisense linker is : 5' 

CTTCGTCGGTCTCCCGGGGCAGACGCAGACCCAGTGGAGCACCACCCAGA 
CCGCTACGCAGGGGCAGCCGGATGCCG 3' (SEQ E> No. 40). After annealing 
the phosphorylated linkers together in 0.1 M NaCMO mM Tris, pH 7.4 they were 
30 ligated into unique Cla I and Sma I sites in Hu-Asp2 in the vector pTAC. For 
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inducible expression using induction with isopropyl b-D-thiogalactopyranoside 
(IPTG), bacterial cultures were grown in LB broth in the presence of ampicillin at 100 
ug/ml, and induced in log phase growth at an OD600 of 0.6-1 .0 with 1 mM IPTG for 
4 hour at 37*C. The cell pellet was harvested by centrifugation. 
5 To create a vector in which the leader sequences can be removed by limited 

proteolysis with caspase 8 such that this liberates a Hu-Asp2 polypeptide beginning 
with the N-temiinal sequence GSFV (SEQ ID Nos. 27 and 28), the following 
procedure was followed. Two phosphorylated oligonucleotides containing the 
caspase 8 cleavage site lETD, #571=5' 
10 GATCGATGACTATCTCTGACTCTCCGCTGGACTCTGGTATCGAAACCGACG 
(SEQ ID No. 41) and #572= 

GATCCGTCGGTTTCGATACCAGAGTCCAGCGGAGAGTCAGAGATAGTCAT 
C (SEQ ID No. 42) were annealed and ligatcd into pET23a+ that had been opened 
with BamHI. After transformation into JM109, the purified vector DNA was 

1 5 recovered and orientation of the insert was confirmed by DNA sequence analysis. 

The following oligonucleotides were used for amplification of the selected 
Hu-Asp2(a) sequence: #573=5'AAGGATCCTTTGTGGAGATGGTGGACAACCTG, 
(SEQ ID No. 43) #554=GAAAGCTTTCATGACTCATCTGTCTGTGGAATGTTG 
(SEQ ID No. 44) which placed BamHI and Hindlll sites flanking the 5' and 3* ends of 

20 the insert, respectively. The Hu-Asp2(a) sequence was amplified fi-om the full length 
Hu-Asp2(a) cDNA cloned into pcDNA3.1 using the Advantage-GC cDNA PGR 
[Clontech] following the manufacturer's supplied protocol using annealing & 
extension at 68*C in a two-step PGR cycle for 25 cycles. The insert and vector were 
cut with BamHI and Hindlll, purified by electrophoresis through an agarose gel, then 

25 ligated using the Rapid DNA Ligation kit [Boerhinger Mannheim]. The ligation 

reaction was used to transform the £. coli strain JM 1 09 [Promega] and colonies were 
picked for the purification of plasmid (Qiagen,Qiaprep minispin) and DNA sequence 
analysis . For inducible expression using induction with isopropyl 
b-D-thiogalactopyranoside (IPTG), the expression vector was transferred into E. coli 

30 strain BL21 (Statagene). Bacterial cultures were grown in LB broth in the presence of 
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ampicillin at 100 ug/ml, and induced in log phase growth at an OD600 of 0.6-1 .0 with 
1 mM IPTG for 4 hour at 37*C. The cell pellet was harvested by centrifugation. 

To assist purification, a 6-His tag can be introduced into any of the above 
constructs following the T7 leader by opening the construct at the BamHI site and 
5 then ligating in the annealed, phosphorylated oligonucleotides containing the six 

histidine sequence #565=GATCGCATCATCACCATCACCATG (SEQ ID No, 45), 
#566=GATCCATGGTGATGGTGATGATGC (SEQ ID No. 46). The 5' overhang for 
each set of oligonucleotides was designed such that it allowed ligation into the BamHI 
site but not subsequent digestion with BamHI. 

1 0 Preparation of Bacterial Pellet: 

36.34g of bacterial pellet representing 10.8L of growth was dispersed into a 
total volume of 200ml using a 20mm tissue homogenizer probe at 3000 to 5000 rpm 
in 2M KCI, O.IM Tris, 0.05M EDTA, ImM DTT. The conductivity adjusted to about 
193mMhos with water. After the pellet was dispersed, an additional amount of the 

1 5 KCI solution was added, bringing the total volume to 5O0 ml. This suspension was 
homogenized further for about 3 minutes at 5000 rpm using the same probe. The 
mixture was then passed through a Rannie high-pressure homogenizer at 10,000psi. 

In all cases, the pellet material was carried forward, while the soluble fraction 
was discarded. The resultant solution was centrifuged in a GSA rotor for 1 hour at 

20 1 2,500 rpm. The pellet was resuspended in the same solution (without the DTT) using 
the same tissue homogenizer probe at 2,000 rpm. After homogenizing for 5 minutes 
at 3000 rpm, the volume was adjusted to 500ml with the same solution, and spun for 1 
hour at 12,500 rpm. The pellet was then resuspended as before, but this time the final 
volume was adjusted to 1 .5L with the same solution prior to homogenizing for 5 

25 minutes. After centrifuging at the same speed for 30 minutes, this procedure was 
repeated. The pellet was then resuspended into about 150ml of cold water, pooling 
the pellets from the six centrifuge tubes used in the GSA rotor. The pellet has 
homogenized for 5 minutes at 3,000 rpm, volume adjusted to 250ml with cold water, 
then spun for 30 minutes. Weight of the resultant pellet was 17.75g. 
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Summary: Lysis of bacterial pellet in KCl solution, followed by centrifugation 
in a GSA rotor was used to initially prepare the pellet. The same solution was then 
used an additional three times for resuspension/homogenization. A final water 
wash/homogenization was then performed to remove excess KCl and EDTA. 
5 SolublizQtion of Recombinant Hu'Asp2(a): 

A ratio of 9-lOml/gram of pellet was utilized for solubilizing the rHuAsp2L from the 
pellet previously described. 1 7.75g of pellet was thawed, and 1 50ml of 81VI guanidine 
HCl, 5mM PME, 0.1% DEA, was added. 3M Tris was used to titrate the pH to 8.6. 
The pellet was initially resuspended into the guanidine solution using a 20 mm tissue 
1 0 homogenizer probe at 1 000 rpm. The mixture was then stirred at 4^C for 1 hour prior 
to centrifugation at 12,500 rpm for 1 hour in GSA rotor. The resultant supernatant 
was then centrifuged for 30 minutes at 40,000 x g in an SS-34 rotor. The final 
supernatant was then stored at -20°C, except for 50 ml. 

1 5 Immobilized Nickel Affinity Chromatography of Solubilized Recombinant Hu-Asp2(a): 

The following solutions were utilized: 

A) 6M Guanidine HCl, 0. IM NaP, pH 8.0, 0.OIM Tris, 5mM pME, 0.5mM 
Imidazole 

20 A*) 6M Urea, 20mM NaP, pH 6.80, 50mM NaCl 

B*) 6M Urea, 20mM NaP, pH 6.20r50mM NaCI, 12mM Imidazole 
C) 6M Urea, 20mM NaP, pH 6.80, 50mM NaCl, 300mM Imidazole 
Note: Buffers A' and C* were mixed at the appropriate ratios to give intermediate 
concentrations of Imidazole. 

25 The 50ml of solubilized material was combined with 50ml of buffer A prior to adding 
to 100- 125ml Qiagen Ni-NTA SuperFlow (pre-equilibrated with buffer A) in a 5 x 
10cm Bio-Rad econo column. This was shaken gently overnight at 4°C in the cold 
room. 

Chromatography Steps: 
30 Drained the resultant flow through. 

Washed with 50ml buffer A (collecting into flow through fraction) 
Washed with 250ml buffer A (wash 1) 
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Washed with 250ml buffer A (wash 2) 
Washed with 250m] buffer A' 
Washed with 250ml buffer B' 
Washed with 250ml buffer A' 
5 Eluted with 250ml 75mM Imidazole 

Eluted with 250ml 150mM Imidazole (150-1) 
Eluted with 250ml 150mM Imidazole (150-2) 
Eluted with 250ml 300mM Imidazole (300-1) 
Eluted with 250ml BOOmM Imidazole (300-2) 
1 0 Eluted with 250ml 300mM Imidazole (300-3) 

Chromatography Results: 

The Hu-Asp(a) eluted at 75mM Imidazole through 300mM Imidazole. The 75mM 
fraction, as well as the first 1 50mM Imidazole (1 50-1) fraction contained 
1 5 contaminating proteins as visualized on Coomassie Blue stained gels. Therefore, 
fractions 150-2 and 300-1 will be utilized for refolding experiments since they 
contained the greatest amount of protein as visualized on a Coomassie Blue stained 
gel. 

Refolding Experiments of Recombinant HU'Asp2(a): 

20 Experiment I: 

Forty ml of 1 50-2 was spiked with IM DTT, 3M Tris, pH 7.4 and DEA lo a final 
concentration of 6mM, 50mM, and 0.1% respectively. This was diluted suddenly 
(while stirring) with 200ml of (4°C) cold 20mM NaP, pH 6.8, 150mM NaCL This 
dilution gave a final Urea concentration of IM. This solution remained clear, even if 

25 allowed to set open to the air at room temperature (RT) or at 4°C . 

After setting open to the air for 4-5 hours at 4^C, this solution was then dialyzed 
overnight against 20mM NaP, pH 7.4, ISOrnM NaCl, 20% glycerol. This method 
effectively removes the urea in the solution without precipitation of the protein. 

30 
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Experiment 2: 

Some of the 150-2 eluate was concentrated 2x on an Amicon Centriprep, 10,000 
MWCO, then treated as in Experiment ] . This material also stayed in solution, with 
no visible precipitation. 
5 Experiment 3: 

89ml of the 1 50-2 eluate was spiked with IM DTT, 3M Tris, pH 7.4 and DEA to a 
final concentration of 6mM, 50mM, and 0.1% respectively. This was diluted 
suddenly (while stirring) with 445ml of (4X) cold 20mM NaP, pH 6.8, 150mM NaCl. 
This solution appeared clear, with no apparent precipitation. The solution was 

1 0 removed to RT and stirred for 1 0 minutes prior to adding MEA to a final 

concentration of 0. 1 mM. This was stirred slowly at RT for 1 hour. Cysiamine and 
CUSO4 were then added to final concentrations of ImM and 10 jiM respectively. The 
solution was stirred slowly at RT for 10 minutes prior to being moved to the 4°C cold 
room and shaken slowly overnight, open to the air. 

15 The following day, the solution (still clear, with no apparent precipitation) was 

centrifuged at 100,000 x g for 1 hour. Supematants from multiple runs were pooled, 
and the bulk of the stabilized protein was dialyzed against 20mM NaP, pH 7.4, 
150mM NaCl, 20% glycerol. After dialysis, the material was stored at -20°C. 

Some (about 10 ml) of the protein solution (still in IM Urea) was saved back 

20 for biochemical analyses, and frozen at -20^C for storage. 

Example 10 

Expression of Hu-Asp2 and Derivatives in Insect Cells 
Any materials (vectors, host cells, etc.) and methods that are useful to express 
25 Hu-Asp2(a) should in principle be equally effective for expression of Hu-Asp2(b). 
Expression by baculovims infection. 

The coding sequence of Hu-Asp2(a) and Hu-ASp2(b) and several derivatives 
were engineered for expression in insect cells using the PCR. For the fulMength 
sequence, a 5 '-sense oligonucleotide primer that modified the translation initiation site 
30 to fit the Kozak consensus sequence was paired with a 3'-antisense primer that 
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contains the natural translation termination codon in the Hu-Asp2 sequence. PCR 
amplification of the pcDNA3.1(hygro)/Hu-Asp2(a) template was used to prepare two 
derivatives of Hu-Asp2(a) or Hu-Asp(b) that delete the C-terminal transmembrane 
domain (SEQ ID Nos. 29-30 and 50-51, respectively) or delete the transmembrane 
5 domain and introduce a hexa-histidine tag at the C-lerminus (SEQ ID Nos. 31-32 and 
52-53) respectively, were also engineered using PCR. The same 5*-sense 
oligonucleotide primer described above was paired with either a 3'-antisense primer 
that (1) introduced a translation termination codon after codon 453 (SEQ ID No. 3) or 
(2) incoiporated a hexa-histidine tag followed by a translation termination codon in 

10 the PCR using pcDNA3.1(hygro)/Hu-Asp-2(a) as the template. In all cases, the PCR 
reactions were performed amplified for 15 cycles using Pwol DNA polymerase 
(Boehringer-Mannheim) as outlined by the supplier. The reaction products were 
digested to completion with BamHl and Notl and ligated to BamHl and Noil digested 
baculovirus transfer vector pVL1393 (Invitrogen). A portion of the ligations was used 

1 5 to transform competent £. coli DH5_ cells followed by antibiotic selection on 

LB-Amp. Plasmid DNA was prepared by standard alkaline lysis and banding in CsCI 
to yield the baculovirus transfer vectors pVL1393/Asp2(a), pVL1393/Asp2(a)ATM 
and pVL1393/Asp2(a)ATM(His)^. Creation of recombinant baculo viruses and 
infection of sf9 insect cells was performed using standard methods. 

20 Expression by transfection 

Transient and stable expression of Hu-Asp2(a)ATM and 
Hu-Asp2(a)ATM(His)6 High 5 insect cells was performed using the insect 
expression vector pI27V5-His. The DNA inserts from the expression plasmids vectors 
pVL1393/Asp2(a), pVL1393/Asp2(a)ATM and pVL1393/Asp2(a)ATM(His)6 were 

25 excised by double digestion with BamHl and Notl and subcloned into BamHl and 

Not] digested pIZA^5-His using standard methods. The resulting expression plasmids, 
referred to as pIZ/Hu-Asp2ATM and pIZ/Hu-Asp2ATM(His)6, were prepared as 
described above. 

For transfection. High 5 insect cells were cultured in High Five serum free 
30 medium supplemented with 10 ^g/ml gentamycin at 27 °C in sealed flasks. 
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Transfections were performed using High five cells. High five serum free media 
supplemented with 10 ^ig/ml genlamycin, and InsectinPlus liposomes (Invilrogen, 
Carlsbad, CA) using standard methods. 

For large scale transient transfections, 1 .2 x 10' high five cells were plated in a 
5 1 50 mm tissue culture dish and allowed to attach at room temperature for 1 5-30 

minutes. During the attachment time the DNA/ liposome mixture was prepared by 
mixing 6 ml of serum free media, 60 \ig Hu-Asp2(a)ATM/pIZ (+/- His) DNA and 120 
\i] of Insectin Plus and incubating at room temperature for 15 minutes. The plating 
media was removed fi-om the dish of cells and replaced with the DNA/liposome 

1 0 mixture for 4 hours at room temperature with constant rocking at 2 rpm. An 

additional 6 ml of media was added to the dish prior to incubation for 4 days at 27 
in a humid incubator. Four days post transfection the media was harvested, clarified 
by centrifugation at 500 x g, assayed for Hu-Asp2(a) expression by Western blotting. 
For stable expression, the cells were treated with 50 ^g/ml Zeocin and the surviving 

1 5 pool used to prepared clonal cells by limiting dilution followed by analysis of the 
expression level as noted above. 

Purification ofHu-Asp2(a)ATMand Hu-Asp2(a)ATM(His)^ 

Removal of the transmembrane segment from Hu-Asp2(a) resulted in the 

20 secretion of the polypeptide into the culture medium. Following protein production 

by either baculovirus infection or transfection, the conditioned medium was harvested, 
clarified by centrifugation, and dialyzed against Tris-HCl (pH 8.0). This material was 
then purified by successive chromatography by anion exchange (Tris-HCl, pH 8.0) 
followed by cation exchange chromatography (Acetate buffer at pH 4.5) using NaCl 

25 gradients. The elution profile was monitored by (1 ) Western blot analysis and (2) by 
activity assay using the peptide substrate described in Example 12. For the 
Hu-Asp2(a)ATM(His)6, the conditioned medium was dialyzed against Tris buffer (pH 
8.0) and purified by sequential chromatography on IMAC resin followed by anion 
exchange chromatography. 
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Amino-lenminal sequence analysis of the purified Hu-Asp2(a)ATM(His)j, 
protein revealed that the signal peptide had been cleaved [TQHGIRLPLR, 
corresponding to SEQ ID NO: 32, residues 22-3]. 



5 Example 11 

Expression orHu*Asp2(a) and Hu-Asp(b) in CHO cells 

The materials (vectors, host cells, etc.) and methods described herein for 
expression of Hu-Asp2(a) are intended to be equally applicable for expression of 
Hu-Asp2(b). 

10 Heterologous expression of Hu- Asp- 2 (a) in CHO-KJ cells 

The entire coding sequence of Hu-Asp2(a) was cloned into the mammalian 
expression vector pcDNA3.1(+)Hygro (Invitrogen, Carlsbad, CA) which contains the 
CMV immediate early promoter and bGH polyadenylation signal to drive over 
expression. The expression plasmid, pcDNA3.1(+)Hygro/Hu-Asp2(a), was prepared 

1 5 by alkaline lysis and banding in CsCl and completely sequenced on both strands to 
verify the integrity of the coding sequence. 

Wild-type Chinese hamster ovary cells (CHO-Kl) were obtained from the 
ATCC. The cells were maintained in monolayer cultures in a-MEM containing 10% 
FCS at 37**C in 5% COj. Two 100 mm dishes of CHO-Kl cells (60% confluent) were 

20 transfected with pcDN A3 . 1 (+)/Hygro alone (mock) or 

pcDNA3.1(+)Hygro/Hu-Asp2(a) or pcDNA3.1(+)Hygro/Hu-Asp2(b) using the 
cationic liposome DOTAP as recommended by the supplier (Roche, Indianapolis, IN). 
The cells were treated with the plasmid DNA/Iiposome mixtures for IS hours and then 
the medium replaced with growth medium containing 500 Units/ml hygromycin B. ht 

25 the case of pcDNA3.1(+)Hygro/Hu-Asp2(a) or (b) transfected CHO-Klcells, 

individual hygromycin B-resistant cells were cloned by limiting dilution. Following 
clonal expansion of the individual cell lines, expression of Hu-Asp2(a) or Hu-Asp2(b) 
protein was assessed by Western blot analysis using a polyclonal rabbit antiserum 
raised against recombinant Hu-Asp2 prepared by expression in £. colt. Near 

30 confluent dishes of each cell line were harvested by scraping into PBS and the cells 
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recovered by centrifugation. The cell pellets were resuspended in cold lysis buffer (25 
mM Tris-HCl (pH 8.0)/5 mM EDTA) containing protease inhibitors and the cells 
lysed by sonication. The soluble and membrane fractions were separated by 
centrifugation (105,000 x g, 60 min) and normalized amounts of protein from each 
5 fraction were then separated by SDS-PAGE. Following electrotransfer of the 

separated polypeptides to PVDF membranes, Hu-Asp-2(a) or Hu-Asp2(b) protein was 
detected using rabbit anti-Hu-Asp2 antiserum (I/IOOO dilution) and the 
antibody-antigen complexes were visualized using alkaline phosphatase conjugated 
goat anti-rabbit antibodies (1/2500). A specific immunoreactive protein with an 

1 0 apparent Mr value of 65 kDa was detected in pcDN A3 . 1 (+)Hygro/Hu-Asp2 

transfected cells and not mock-transfected cells. Also, the Hu-Asp2 polypeptide was 
only detected in the membrane fraction, consistent with the presence of a signal 
peptide and single transmembrane domain in the predicted sequence. Based on this 
analysis, clone #5 had the highest expression level of Hu-Asp2{a) protein and this 

1 5 production cell lines was scaled up to provide material for purification. 

Purification of recombinant Hu'Asp-2(a) firom CH0-Kl/Hu-Asp2 clone #5 

In a typical purification, clone #5 cell pellets derived from 20 150 mm dishes 
of confluent cells, were used as the starting material. The cell pellets were 
resuspended in 50 ml cold lysis buffer as described above. The cells were lysed by 

20 polytron homogenization (2 x 20 sec) and the lysate centrifuged at 338,000 x g for 20 
minutes. The membrane pellet was then resuspended in 20 ml of cold lysis buffer 
containing 50 mM p-octylglucoside followed by rocking at 4 ''C for 1 hour. The 
detergent extract was clarified by centrifugation at 338,000 x g for 20 mmutes and the 
supernatant taken for further analysis. 

25 The P-octylglucoside extract was applied to a Mono Q anion exchange 

column that was previously equilibrated with 25 mM Tris-HCl (pH 8.0)/50 mM 
P-octylglucoside. Following sample application, the column was eluted with a linear 
gradient of increasing NaCl concentration (0-1.0 M over 30 minutes) and individual 
firactions assayed by Western blot analysis and for P-secretase activity (see below). 

30 Fractions containing both Hu-Asp-2(a) immunoreactivity and P-sccreiase activity 
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were pooled and dialyzed against 25 mM NaOAc (pH 4.5)/50 mM P-octylglucoside. 
Following dialysis, precipitated material was removed by cenlrifugation and the 
soluble material chromatographed on a MonoS cation exchange column that was 
previously equilibrated in 25 mM NaOAc (pH 4.5)/ 50 mM p-octylglucoside. The 

5 column was eluted using a linear gradient of increasing NaCl concentration (0-1 .0 M 
over 30 minutes) and individual fractions assayed by Western blot analysis and for 
P-secretase activity. Fractions containing both Hu-Asp2 immunoreactivity and 
P-secretase activity were combined and determined to be >95% pure by 
SDS-PAGE/Coomassie Blue staining. 

1 0 The same methods were used to express and purify Hu-Asp2(b). 

Example 12 

Assay of Hu-Asp2 P-secretase activity using peptide substrates 

P-secretase assay 

15 Recombinant human Asp2(a) prepared in CHO cells and purified as described 

in Example 1 1 was used to assay Asp2(a) proteolytic activity directly. Activity assays 
for Asp2(a) were performed using synthetic peptide substrates containing either the 
wild-type APP P-secretase site (SEVKM 1 DAEFR; SEQ E) NO: 64), the Swedish 
KM-NL mutation (SEVNLiDAEFR; SEQ K) NO: 63), or the Ap40 and 42 

20 Y-secretase sites (RRGGVV I lA 1 TVFVGER; SEQ ID NO: 65). Reactions were 
performed in 50 nnM 2-[N-morpholino]ethane-suIfonate ("Na-MES " pH 5.5) 
containing 1 % p-oclylglucoside, 70 mM peptide substrate, and recombinant Asp2(a) 
( 1 -5 ^ig protein) for various times at 37^C. The reaction products were quantified by 
RP-HPLC using a linear gradient from 0-70 B over 30 minutes (A=0.1% TFA in 

25 water, B=0. 1 %TFA/1 0%water/90%AcCN). The elution profile was monitored by 
absorbance at 214 nm. In preliminary experiments, the two product peaks which 
eluted before the intact peptide substrate, were confirmed to have the sequence 
DAEFR (SEQ ID NO: 72)and SEVNL (SEQ ID NO: 73) using both Edman 
sequencing and MADLI-TOF mass spectrometry. Percent hydrolysis of the peptide 

30 substrate was calculated by comparing the integrated peak areas for the two product 
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peptides and the starting material derived from the absorbance at 214 nm. The 
sequence of cleavage/hydrolysis products was confirmed using Edman sequencing and 
MADLI-TOF mass spectrometry. 

The behavior of purified Asp2(a) in the proteolysis assays was consistent with 

5 the prior anti-sense studies which indicated that Asp2(a) possesses P-secrctase 

activity. Maximal proteolysis was seen with the Swedigh p-secretase peptide, which, 
after 6 hours, was about 10- fold higher than wild type APP. 

The specificity of the protease cleavage reaction was determined by 
performing the P-secretase assay in the presence of 8 pM pepstatin A and the presence 

10 of a cocktail of protease inhibitors (10 (xM leupeptin, 10 pM E64, and 5 mM EDTA). 
Proteolytic activity was insensitive to both the pepstatin and the cocktail, which are 
inhibitors of cathepsin D (and other aspartyl proteases), serine proteases, cysteinyl 
proteases, and metalloproteases, respectively. 

Hu-Asp2(b) when similarly expressed in CHO cells and purified using 

IS identical conditions for extraction with P-octylglucoside and sequential 

chromatography over Mono Q and Mono S also cleaves the Swedish P-secretase 
peptide in proteolysis assays using identical assay conditions. 

Collectively, this data establishes that both forms of Asp2 (Hu-Asp2(a) and 
Hu-Asp2(b)) act directly in cell-free assays to cleave synthetic APP peptides at the P- 

20 secretase site, and that the rate of cleavage is greatly increased by the Swedish 
KM-NL mutation that is associated with Alzheimer's disease. 

An alternative P-secretase assay utilizes internally quenched fluorescent 
substrates to monitor enzyme activity using fluorescence spectroscopy in a single 
sample or multiwell format. Each reaction contained 50 mM Na-MES (pH 5.5), 

25 peptide substrate MCA-EVKMDAEF[K-DNP] (SEQ ID NO: 7 1 ; BioSource 

International) (50 pM) and purified Hu-Asp-2 enzyme. These components were 
equilibrated to 37 "^C for various times and the reaction initiated by addition of 
substrate. Excitation was perfonned at 330 nm and the reaction kinetics were 
monitored by measuring the fluorescence emission at 390 nm. To detect compounds 

30 that modulate Hu-Asp-2 activity, the test compounds were added during the 
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preincubation phase of the reaction and the kinetics of the reaction monitored as 
described above. Activators are scored as compounds that increase the rale of 
appearance of fluorescence while inhibitors decrease the rate of appearance of 
fluorescence. 

5 It will be clear that the invention may be practiced otherwise than as 

particularly described in the foregoing description and examples. 

Numerous modifications and variations of the present invention are possible in 
light of the above teachings and, therefore, are within the scope of the invention. The 
entire disclosure of all publications cited herein are hereby incorporated by reference. 
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What is claimed is: 

1 . A polypeptide comprising the amino acid sequence of a mammalian 
amyloid protein precursor (APP) or fragment thereof containing an APP cleavage site 
recognizable by a mammalian P-secretase, and further comprising two lysine residues 

5 at the carboxyl terminus of the amino acid sequence of the mammalian APP or APP 
fragment. 

2. A polypeptide according to claim 1 comprising the amino acid 
sequence of a mammalian amyloid protein precursor (APP), and further comprising 

1 0 two lysine residues at the carboxyl terminus of the amino acid sequence of the 
mammalian amyloid protein precursor. 

3. A polypeptide according to claim 1 wherein the polypeptide ftirlher 
includes a marker. 

15 

4. A polypeptide according to claim 3 wherein the marker comprises a 
reporter protein amino acid sequence attached to the APP amino acid sequence. 

5. • A polypeptide according to claim 4 wherein the reporter protein 
20 comprises an amino acid sequence of a fluorescing protein. 

6. A polypeptide according to claim 1 , wherein the mammalian APP is a 
human APP. 

25 7. A polypeptide according to claim 6, wherein the human APP 

comprises at least one variation selected from the group consisting of a Swedish 
KM-NL mutation and a London V71 7-F mutation. 
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8. A polypeptide according to claim 6, wherein the human APP is 
selected from the group consisting of: an APP695 isoform, an APP 751 isoform, and 
an APP770 isoform. 

5 9. A polypeptide according to claim 1 wherein the APP protein or 

fragment thereof comprises the APP-Sw P-secretase peptide sequence NLDA. 

1 0. A polypeptide according to claim 9 wherein the APP protein or 
fragment thereof comprises the APP-Sw |J-secretase peptide sequence 

10 SEVNLDAEFR (SEQ ID NO: 63). 

11. A polypeptide according to claim 9 wherein the APP protein or 
fragment thereof further includes an APP transmembrane domain carboxy-lerminal to 
the APP-Sw P-secretase peptide sequence. 

15 

12. A polypeptide according to claim 9 wherein the APP protein or 
fragment thereof comprises a chimeric APP, said chimeric APP including partial APP 
amino acid sequences from at least two species. 

20 13. A polypeptide according to claim 12 wherein the chimeric APP 

includes amino acid sequence of a human APP and a rodent APP. 

14. A polynucleotide comprising a nucleotide sequence that encodes a 
polypeptide according to any one of claims 1 . 

25 

15. A vector comprising a polynucleotide according to claim 14. 

16. A vector according to claim 1 5 wherein said polynucleotide is operably 
linked to a promoter to promote expression of the polypeptide encoded by the 

30 polynucleotide in a host cell. 



-84- 



wo 01/49097 



PCT/lBOl/00797 



1 7. A host cell iransfoimed or transfected with a polynucleotide according 
to claim 14 or a vector according to claim 1 5 or 1 6. 

1 8. A host cell according to claim 1 7 that is a manmialian cell. 

5 

19. A polypeptide useful for assaying for modulators of P-secretase 
activity, said polypeptide comprising an amino acid sequence of the formula NH2-X- 
Y-Z-KK-COOH; 

wherein X, Y, and Z each comprise an amino acid sequence of at least 
10 one amino acid; 

wherein-NHj-X comprises an amino-tcrminal amino acid sequence 
having at least one amino acid residue; 

wherein Y comprises an amino acid sequence of a P-secretase 
recognition site of a mammalian amyloid protein precursor (APP); and 
15 wherein Z-KK-COOH comprises a carboxy*terminaI amino acid 

sequence ending in two lysine (K) residues. 

20. A polypeptide according to claim 19 wherein the carboxyl-terminal 
amino acid sequence Z includes a hyrdrophobic domain that is a transmembrane 

20 domain in host cells that express the polypeptide. 

21 . A polypeptide according to claim 1 9 wherein the amino-terminal 
amino acid sequence X includes an amino acid sequence of a reporter protein. 

25 22. A polypeptide according to claim 1 9 wherein the P-secretase 

recognition site Y comprises the human APP-Sw P-secretase peptide sequence 
NLDA. 

23. A polynucleotide comprising a nucleotide sequence that encodes a 
30 polypeptide according to any one of claims 1 9-22. 
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24. A purified polypeptide comprising the murine Asp2 amino acid 
sequence set forth in SEQ ID NO: 8, or a fragment thereof that retains the p-secretase 
activity of said murine Asp2. 

5 25. A polynucleotide comprising a nucleotide sequence lhat encodes the 

polypeptide of claim 24. 

26. A polynucleotide according to claim 25 comprising the nucleotide 
sequence set forth in SEQ ED NO: 7. 

10 

27. A purified murine Asp2(b) polypeptide comprising the amino acid 
sequence set for in SEQ ID NO: 8 from residues M89 and 215-501, but lacking 
residues 190-214. 

15 28. A polynucleotide comprising a nucleotide sequence that encodes the 

murine Asp2(b) polypeptide according to claim 27. 

29. A vector comprising a polynucleotide according to claim 25. 

20 30. A vector according to claim 29 wherein said polynucleotide is operably 

linked to a promoter to promote expression of the polypeptide encoded by the 
polynucleotide in a host cell. 

31 . A host cell transformed or transfected with a vector according to claim 

25 30. 

32. A host cell according to claim 3 1 that is a mammalian cell. 

33. A host cell according to claim 3 1 that expresses the polypeptide on its 
30 surface. 
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5 



15 



20 



34. A host cell according to claim 3 1 , wherein the host cell is transfected 
with a nucleic acid comprising a nucleotide sequence that encodes an amyloid 
precursor protein (APP) that includes two carboxy-terminal lysine residues. 



35. A host cell according to claim 34 that expresses the polypeptide and 
the APP on its surface. 

36. A method of making a murine Asp2 polypeptide comprising steps of 
10 culturing a host cell of claim 61 in a culture medium under conditions in which the 

cell produces the polypeptide that is encoded by the polynucleotide. 

37. A method according to claim 36, further comprising a step of purifying 
the polypeptide from the cell or the culture medium. 



38. A host cell transformed or transfected with a polynucleotide according 
to claim 25. 

39. A host cell according to claim 38 that is a mammalian cell. 

40. A host cell according to claim 38 that expresses the polypeptide on its 
surface. 



41 . A host cell according to claim 38, wherein the host cell is transfected 
25 with a nucleic acid comprising a nucleotide sequence that encodes an amyloid 

precursor protein (APP) or fragment thereof containing a P-secretase cleavage site. 

42. A host cell according to claim 41 wherein the APP includes two 
carboxy-terminal lysine residues. 

30 
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43. A host cell according to claim 41 wherein the APP comprises the 
Swedish mutation (K— ►N, M— >L) adjacent to the P-secretase cleavage site. 

44. A host cell according to claim 41 that expresses the polypeptide and 
the APP on its surface. 

45. A method of making a murine Asp2 polypeptide comprising steps of 
culturing a host cell of claim 38 in a culture medium under conditions in which the 
cell produces the polypeptide that is encoded by the polynucleotide. 

46. A method according to claim 45, further comprising a step of purifying 
the polypeptide from the cell or the culture medium. - 



47. A purified polypeptide comprising a fragment of a mammalian Asp2 
15 protein, wherein said polypeptide lacks the Asp2 transmembrane domain of said Asp2 

protein, and wherein the polypeptide and the fragment retain the p-secretase activity 
of said mammalian Asp2 protein. 

48. A purified polypeptide according to claim 47 comprising a fragment of 
20 a human Asp2 protein that retains the p-secretase activity of said human Asp2 protein. 

49. A purified polypeptide according to claim 48, wherein said polypeptide 
comprises a fragment of Asp2(a) having the amino acid sequence set forth in SEQ ID 
NO: 4, and wherein said polypeptide lacks transmembrane domain amino acids 455 to 

25 477 ofSEQIDNO:4. 

50. A purified polypeptide according to claim 49, wherein said polypeptide 
ftirther lacks cytoplasmic domain amino acids 478 to 501 of SEQ ID NO: 4. 
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51. A purified polypeptide according to claim 50, wherein said polypeptide 
further lacks amino acids 420-454 of SEQ ID NO: 4. 

52. A purified polypeptide according to any one of claims 48-5 1 , wherein 
5 said polypeptide comprises an amino acid sequence: 

that includes amino acids 58 to 419 of SEQ ID NO: 4, and 
that lacks amino acids 22 to 57 of SEQ ID NO: 4. 

53. A purified polypeptide according to any one of claims 48-5 1 , wherein 
1 0 said polypeptide comprises an amino acid sequence: 

that includes amino acids 46 to 419 of SEQ ID NO: 4, and 
that lacks amino acids 22 to 45 of SEQ ID NO: 4. 

54. A purified polypeptide according to claim 49, wherein said polypeptide 
15 comprises an amino acid sequence that includes amino acids 22 to 454 of SEQ ID 

NO: 4. 

55. A purified polypeptide according to claim 47 comprising the amino 
acid sequence of human Asp-2(b) protein set forth as SEQ ID NO: 6, or fragments 

• 20 thereof that retain P-secretase activity. 

56. A purified polypeptide according to claim 48, wherein said polypeptide 
comprises a fragment of Asp2(b) having the amino acid sequence set forth in SEQ ID 
NO: 6, and wherein said polypeptide lacks transmembrane domain amino acids 430 to 

25 452ofSEQIDNO:6. 

57. A purified polypeptide according to claim 56, wherein said polypeptide 
further lacks cytoplasmic domain amino acids 453 to 476 of SEQ ID NO: 6. 
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58. A purified polypeptide according to claim 57, wherein said polypeptide 
further lacks amino acids 395-429 of SEQ ID NO: 4. 

59. A purified polypeptide according to any one of claims 56-58, wherein 
5 said polypeptide comprises an amino acid sequence: 

that includes amino acids 58 to 394 of SEQ ID NO: 4, and 
that lacks amino acids 22 to 57 of SEQ ID NO: 4. 

60. A purified polypeptide according to any one of claims 56-58, wherein 
10 said polypeptide comprises an amino acid sequence: 

that includes amino acids 46 to 394 of SEQ ID NO: 4, and 
that lacks amino acids 22 to 45 of SEQ ID NO: 4. 

61 . A purified polypeptide according to claim 56, wherein said polypeptide 
1 5 comprises an amino acid sequence that includes amino acids 22 to 429 of SEQ ID 

NO: 6. 

62. A polypeptide comprising an amino acid sequence at least 95% 
identical to a fragment of a human Asp2 protein, wherein said polypeptide and said 

20 fragment lack a transmembrane domain and retain P-secretase activity of the human 
Asp2 protein. 

63. A purified polynucleotide comprising a nucleotide sequence that 
encodes the polypeptide of any one of claims 47-63. 

25 

64. A polynucleotide of claim 47 wherein the polypeptide comprises a 
fragment of human Asp2 protein. 

65. A polynucleotide of claim 64 wherein the polypeptide comprises a 
30 fragment of Asp2(a) having the amino acid sequence set forth as SEQ ED NO: 4, and 
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wherein the polypeptide lacks the transmembrane domain amino acids 455-477 of 
SEQ ID NO: 4 

66. A polynucleotide of claim 64, wherein the polypeptide fiirther lacks 
5 cytoplasmic domain amino acids 478-501 of SEQ ID NO: 4. 

67. A purified polynucleotide of claim 66, wherein said polypeptide further 
lacks amino acids 420-454 of SEQ ID NO: 4. 

10 68. A polynucleotide of claim 65, wherein the polypeptide comprises an 

amino acid sequence: 

that includes amino acids 58-419 of SEQ ID NO: 4, and 
that lacks amino acids 22-57 of SEQ ID NO: 4. 

15 69. A polynucleotide of claim 65, wherein the polypeptide comprises an 

amino acid sequence: 

that includes amino acids 46-419 of SEQ ID NO: 4, and 
that lacks amino acids 22-45 of SEQ ID NO: 4. 

20 70. A polynucleotide of claim 65, wherein the polypeptide comprises an 

amino acid sequence that includes amino acids 22-454 of SEQ ID NO: 4. 

71 . A polynucleotide of claim 64, wherein the polypeptide comprises a 
fragment of human Asp2(b) having the amino acid set forth in SEQ ID NO: 6, and 

25 wherein the polypeptide lacks transmembrane domain amino acids 430-452 of SEQ 
ID NO: 6. 

72. A polynucleotide of claim 7 1 , wherein the polypeptide lacks 
cytoplasmic domain amino acids 453-476 of SEQ ID NO: 6. 

30 
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73. A polynucleotide of claim 72, wherein the polypeptide further lacks 
amino acids 395-429 of SEQ ID NO: 6. 

74. A polynucleotide ofclaim 71, wherein the polypeptide comprises an 
5 amino acid sequence: 

that includes amino acids 58-394 of SEQ ID NO: 6, and 
that lacks amino acids 22 to 57 of SEQ ED NO: 6. 

75. A polynucleotide ofclaim 71, wherein the polypeptide comprises an 
10 amino acid sequence: 

that includes amino acids 46-394 of SEQ ID NO: 6, and 
that lacks amino acids 22-45 of SEQ ID NO: 6. 

76. A polynucleotide ofclaim 71 , wherein the polypeptide comprises an 
1 5 amino acid sequence that includes amino acids 22 to 429 of SEQ ID NO: 6. 



77. A vector comprising a polynucleotide according to claim 63. 

20 78. A host cell transformed or transfected with a polynucleotide according 

to claim 63. 

79. A host cell transformed or transfected with a vector of claim 77. 

25 80. A polynucleotide comprising a nucleotide sequence that hybridizes 

under stringent conditions to a nucleic acid comprising the sequence set forth in SEQ 
ID NO: 4 or SEQ ID NO: 6, wherein the nucleotide sequence encodes a polypeptide 
having P-secretase biological activity. 

30 8 1 . A vector comprising a polynucleotide of claim 80. 
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82. A host cell transformed or transfected with a pol>Tiucleotide of claim 

80. 

83. A method for assaying for modulators of P-secretase activity, 
5 comprising the steps of: 

(a) contacting a first composition with a second composition both in the 
presence and in the absence of a putative modulator compound, wherein the first 
composition comprises a mammalian P-secretase polypeptide or biologically active 
fragment thereof, and wherein the second composition comprises a substrate 

10 polypeptide having an amino acid sequence comprising a P-secretase cleavage site; 

(b) measuring cleavage of the substrate polypeptide in the presence and in 
the absence of the putative modulator compound; and 

(c) identifying modulators of P-secretase activity from a difference in 
cleavage in the presence versus in the absence of the putative modulator compound, 

IS wherein a modulator that is a P-secretase antagonist reduces such cleavage and a 
modulator that is a P*secretase agonist increases such cleavage. 

84. A method according to claim 83, wherein the first composition 
comprises a purified human Asp2 polypeptide. 

20 

85. A method according to claim 83, wherein the first composition 
comprises a soluble fragment of a human Asp2 polypeptide that retains Asp2 P- 
secretase activity. 

25 86. A method according to claim 85 wherein the soluble fi-agment is a 

fragment lacking an Asp2 transmembrane domain. 

87. A method according to claim 83, wherein the substrate polypeptide of 
the second composition comprises the amino acid sequence SEVNLDAEFR. 

30 
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88. A method according to claim 83, wherein the substrate polypeptide of 
the second composition comprises the amino acid sequence EVKMDAEF. 

89. A method according to claim 83, wherein the second composition 
5 comprises a polypeptide having an amino acid sequence of a human amyloid 

precursor protein (APP). 

90. A method according to claim 89, wherein the human amyloid precursor 
protein is selected from the group consisting of: APP695, APP751, and APP770. 

10 

91 . A method according to claim 90, wherein the human amyloid precursor 
protein includes at least on mutation selected from a KM-*NL Swiss mutation and a 
V-F London mutation. 

IS 92. A method according to claim 89, wherein the polypeptide having an 

amino acid sequence of a human APP ftirther comprises an amino acid sequence 
comprising a marker sequence attached amino-terminal to the amino acid sequence of 
the human amyloid precursor protein. 



20 93. A method according to claim 89, wherein the polypeptide having an 

amino acid sequence of a human APP fiirther comprises two lysine residues attached 
to the carboxyl terminus of the amino acid sequence of the human APP. 

94. A method according to claim 82, wherein the second composition 
25 comprises a eukaryotic cell that expresses amyloid precursor protein (APP) or a 

fragment thereof containing a p-secretase cleavage site. 

95. A method according to claim 94, wherein the APP expressed by the 
host cell is an APP variant that includes two carboxyl-terminal lysine residues. 

30 
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96. A method for identifying agents thai inhibit the activity of human Asp2 
aspartyl protease (Hu-Asp2), comprising the steps of: 

(a) contacting amyloid precursor protein (APP) and purified and isolated 
Hu-Asp2 in the presence and absence of a test agent; 
5 (b) determining the APP processing activity of the Hu-Asp2 in the 

presence and absence of the test agent; and 



(c) comparing the APP processing activity of the Hu-Asp2 polypeptide in 
the presence of the test agent to the activity in the absence of the test agent to identify 
10 an agent that inhibits the APP processing activity of Hu-Asp2, wherein reduced 
activity in the presence of the test agent identifies an agent that inhibits Hu-Asp2 
activity. 



97. A method according to claim 96, wherein the Hu-Asp2 comprises a 
IS polypeptide purified and isolated from a cell transformed or transfected with a 
polynucleotide comprising a nucleotide sequence that encodes the Hu-Asp2. 



98. A method according to claim 60 wherein the nucleotide sequence is 
selected from the group consisting of: 
20 (a) a nucleotide sequence encoding the Hu-Asp2{a) amino acid sequence 

set forth in SEQ ID NO: 4; 

(b) a nucleotide sequence encoding the Hu-Asp2(b) amino acid sequence 
set forth in SEQ ID NO: 6; 

(c) a nucleotide sequence encoding a fragment of Hu-Asp2(a) (SEQ ID 
25 NO: 4) or Hu-Asp2(b) (SEQ ID NO: 6), wherein said fragment exhibits aspartyl 

protease activity characteristic of Hu-Asp2(a) or Hu-Asp2(b); and 

(d) a nucleotide sequence of a polynucleotide that hybridizes under 
stringent hybridization conditions to the complement of a Hu-Asp2-encoding 
polynucleotide selected from the group consisting of SEQ ED NO: 3 and SEQ ID NO: 

30 5. 
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99. A method according to claim 97 wherein the Hu-Asp2 comprises the 
Hu-Asp2(a) amino acid sequence set forth in SEQ ID NO: 4. 

5 100. A method according to claim 97, wherein the Hu-Asp2 comprises the 

Hu-Asp2(b) amino acid sequence set forth in SEQ ID NO: 6. 

101 . A method according to claim 97, wherein the Hu-Asp2 comprises a 
fragment of Hu-Asp2(a) (SEQ ID NO: 4) or Hu-Asp2(b) (SEQ ID NO: 6), wherein 

10 said fragment exhibits aspartyl protease activity characteristic of Hu-Asp2(a) or Hu- 
Asp2(b). 

102. A method according to claim 96. wherein the APP comprises the 
Swedish mutation (K"^N, M-^L) adjacent to the p-secretase processing site. 

15 

103. A method according to claim 96, further comprising a step of treating 
Alzheimer's Disease with an agent identified as an inhibitor of Hu-Asp2 according to 
steps (a)-(c). 

20 1 04. A method for identifying agents that inhibit the activity of human Asp2 

aspartyl protease (Hu-Asp2), comprising the steps of: 

(a) contacting Hu-Asp2 and amyloid precursor protein (APP) in the 
presence and absence of a test agent, wherein the APP comprises a carboxy- 
terminal di-lysine (KK) and wherein the contacting comprises growing a host 

25 cell that expresses the APP in the presence and absence of the test agent; 

(b) determining the APP processing activity of the Hu-Asp2 in the 
presence and absence of the test agent; and 

(c) comparing the APP processing activity of the Hu-Asp2 polypeptide in 
the presence of the test agent to the activity in the absence of the test agent to 

30 identify an agent that inhibits the activity of Hu-Asp2, wherein reduced 
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activity in the presence of the test agent identifies an agent that inhibits Hu- 
Asp2 activity. 

1 05. A method according to claim 1 04, wherein the APP further comprises 
5 the Swedish mutation (K"^N, M"^L) adjacent to the P-secretase processing site. 

106. A method according to claim 104, wherein the host cell has been 
transformed or transfected with a polynucleotide comprising a nucleotide sequence 
that encodes a Hu-Asp2, wherein said nucleotide sequence is selected from the group 

10 consisting of: 

(a) a nucleotide sequence encoding the Hu-Asp2{a) amino acid sequence 
set forth in SEQ ID NO: 4; 

(b) a nucleotide sequence encoding the Hu-Asp2(b) amino acid sequence 
set forth in SEQ ID NO: 6; 

15 (c) a nucleotide sequence encoding a fragment of Hu-Asp2(a) (SEQ ID 

NO: 4) or Hu-Asp2(b) (SEQ ID NO: 6), wherein said fragment exhibits aspartyl 
protease activity characteristic of Hu-Asp2(a) or Hu-Asp2(b); and 

(d) a nucleotide sequence of a polynucleotide that hybridizes under 
stringent hybridization conditions to the complement of a Hu-Asp2-encoding 

20 polynucleotide selected from the group consisting of SEQ ID NO: 3 and SEQ ID NO: 
5. 

107. A method according to claim 104, further comprising a step of treating 
Alzheimer's Disease with an agent identified as an inhibitor of Hu-Asp2 according to 

25 steps (a)-(c). 

1 08. A method for identifying agents that inhibit the activity of human Asp2 
aspartyl protease (Hu-Asp2), comprising the steps of: 

(a) contacting Hu-Asp2 and amyloid precursor protein (APP) in the 
30 presence and absence of a test agent, wherein the contacting comprises 
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growing a host cell transformed or transfected with a polynucleotide 
comprising a nucleotide sequence encoding the Hu-Asp2 in the presence and 
absence of the test agent; 

(b) determining the APP processing activity of the Hu-Asp2 in the 
5 presence and absence of the test agent; and 

(c) comparing the APP processing activity of the Hu-Asp2 polypeptide in 
the presence of the test agent to the activity in the absence of the lest agent to 
identify an agent that inhibits the activity of Hu-Asp2, wherein reduced 
activity in the presence of the test agent identifies an agent that inhibits Hu- 

10 Asp2 activity. 

109. A method according to claim 108, wherein the host cell expresses APP. 

110. A method according to claim 109 wherein the determining step 
1 5 comprises measuring the production of amyloid beta peptide by the cell in the 

presence and absence of the test agent. 

111. A method according to claim 1 09, wherein the host cell expresses an 
APP having an amino acid sequence that includes a carboxy-terminal di-lysine. 

20 

112. A method according to claim 109, wherein the host cell expresses an 
APP comprising the Swedish mutation (K-^N, M-^L) adjacent to the P-secretase 
processing site. 

25 1 13. A method according to claim 1 08, wherein the host cell is a human 

embryonic kidney cell line 293 (HEK293) cell. 
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114. A method according to claim 108 wherein the nucleotide sequence is 
selected from the group consisting of: 

(a) a nucleotide sequence encoding the Hu-Asp2(a) amino acid sequence 
set forth in SEQ ID NO: 4; 
5 (b) a nucleotide sequence encoding the Hu-Asp2(b) amino acid sequence 

set forth in SEQ ID NO: 6; 

(c) a nucleotide sequence encoding a fragment of Hu-Asp2(a) (SEQ ID 
NO: 4) or Hu-Asp2(b) (SEQ ID NO: 6), wherein said fragment exhibits aspartyl 
protease activity characteristic of Hu-Asp2(a) or Hu-Asp2(b); and 
10 (d) a nucleotide sequence of a polynucleotide that hybridizes under 

stringent hybridization conditions to the complement of a Hu-Asp2-encoding 
polynucleotide selected from the group consisting of SEQ ID NO: 3 and SEQ ID NO: 
5. 

IS lis. A method according to claim 108, wherein the host cell comprises a 

vector that comprises the polynucleotide. 

116. A method according to claim 108 wherein the polynucleotide 
comprises a nucleotide sequence encoding the Hu-Asp2(a) amino acid sequence set 

20 forth in SEQ ID NO: 4. 

117. A method according to claim 1 08 wherein the polynucleotide 
comprises a nucleotide sequence encoding the Hu-Asp2(b) amino acid sequence set 
forth in SEQ ID NO: 6. 

2S 

118. A method according to claim 108 wherein the polynucleotide 
comprises a nucleotide sequence encoding a polypeptide comprising a fragment of 
Hu-Asp2(a) (SEQ ID NO: 4) or Hu-Asp2(b) (SEQ ID NO: 6), wherein said fragment 
exhibits aspartyl protease activity characteristic of Hu-Asp2(a) or Hu-Asp2(b). 

30 
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119. A method according to claim 108 wherein the Hu-Asp2 is encoded by 
a nucleotide sequence of a polynucleotide that hybridizes under stringent 
hybridization conditions to the complement of a Hu-Asp2-encoding polynucleotide 
selected from the group consisting of SEQ E)NO: 3 and SEQ ED NO: 5. 

5 

120. A method according to claim 108, further comprising a step of treating 
Alzheimer's Disease with an agent identified as an inhibitor of Hu-Asp2 according to 
steps (a)-(c). 

10 121. A method for identifying agents that modulate the activity of Asp2 

aspartyl protease, comprising the steps of: 

(a) contacting an Asp2 aspartyl protease and amyloid precursor protein 
(APP) in the presence and absence of a test agent, wherein the Asp2 aspartyl protease 
is encoded by a nucleic acid molecule that hybridizes under stringent hybridization 

1 5 conditions to the complement of a Hu-Asp2-encoding polynucleotide selected from 
the group consisting of SEQ ID NO: 3 and SEQ ID NO: 5; 

(b) determining the APP processing activity of the Asp2 aspartyl protease 
in the presence and absence of the test agent; and 

(c) comparing the APP processing activity of the A5p2 aspartyl protease in 
20 the presence of the test agent to the activity in the absence of the agent to identify 

agents that modulate the activity of the Asp2 aspartyl protease, wherein a modulator 
that is an Asp2 inhibitor reduces APP processing and a modulator that is an Asp2 
agonist increases such processing. 

25 1 22. A method according to claim 121, wherein the Asp2 aspartyl protease 

is purified and isolated. 

1 23. A method according to claim 121, further comprising a step of treating 
Alzheimer's Disease with an agent identified as an inhibitor of Hu-Asp2 according to 
30 steps (a)-(c). 
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124. A method for identifying an agent that inhibits APP processing activity 
of human Asp2 aspartyl protease, comprising steps of: 

(a) contacting Hu-Asp2 with an APP substrate for the Hu-Asp2, in the 
presence and absence of a test agent; 
5 (b) determining the proteolytic processing of the APP substrate by the Hu- 

Asp2 in the presence and absence of the test agent; and 

(c) comparing the proteolytic processing of the APP substrate by the Hu- 
Asp2 in the presence and absence of the test agent to identify an agent that inhibits the 
APP processing activity of Hu-Asp2, wherein reduced proteolytic processing of the 
10 APP substrate by the Hu-Asp2 in the presence of the test agent identifies an agent that 
inhibits Hu-Asp2 activity. 

125. A method according to claim 124, wherein the APP substrate is a 
peptide comprising a P-secretase cleavage site of APP. 

15 

126. A method according to claim 125, wherein the P-secretase cleavage 
site comprises the formula P2-Pl-Pr-P2*, wherein 

P2 is an amino acid selected from K and N; 
PI is an amino acid selected from M and L; 
20 P r is the amino acid D; and 

P2* is the amino acid A. 

127. A method according to claim 125, wherein the peptide comprises the 
amino acid sequence KMDA (SEQ ID NO: 64, positions 4-7). 

25 

1 28. A method according to claim 1 26, wherein the peptide comprises the 
amino acid sequence EVKMDAEF (SEQ ED NO: 67). 

129. A method according to claim 125, wherein the peptide comprises the 
30 amino acid sequence NLDA (SEQ ID NO: 66). 
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1 30. A method of reducing cellular production of amyloid beta (AP) from 
amyloid precursor protein (APP), comprising step of transforming or transfecting cells 
with an anti-sense reagent capable of reducing Asp2 polypeptide production by 
5 reducing Asp2 transcription or translation in the cells, wherein reduced Asp2 

polypeptide production in the cells correlates with reduced cellular processing of APP 
into Ap. 



131. A method according to claim 1 30, wherein the cell is a neural cell. 

10 

1 32. A method according to claim 1 30, wherein the anti-sense reagent 
comprises an oligonucleotide comprising a single stranded nucleic acid sequence 
capable of binding to a Hu-Asp mRNA. 



1 5 1 33. A method according to claim 1 30, wherein the anti-sense reagent 

comprises an oligonucleotide comprising a single stranded nucleic acid sequence 
capable of binding to a Hu-Asp DNA. 



1 34. A method of reducing cellular production of amyloid beta (AP) from 
20 amyloid precursor protein (APP), comprising steps of: 

(a) identifying mammalian cells that produce Ap; and 

(b) transforming or transfecting the cells with an anti-sense reagent 
capable of reducing Asp2 polypeptide production by reducing Asp2 
transcription or translation in the cells, wherein reduced Asp2 polypeptide 

25 production in the cells correlates with reduced cellular processing of APP into 

Ap. 

1 35, A method according to claim 1 34, wherein the cell is a neural cell. 
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136. A method according to claim 134, wherein the anti-sense reagent 
comprises an oligonucleotide comprising a single stranded nucleic acid sequence 
capable of binding to a Hu-Asp mRNA. 



5 1 37. A method according to claim 1 33, wherein the anti-sense reagent 

comprises an oligonucleotide comprising a single stranded nucleic acid sequence 
capable of binding to a Hu-Asp DNA. 



138. A method according to claim 1 33, wherein the identifying step 
1 0 comprises diagnosing Alzheimer's disease, where Alzheimer's disease correlates with 
the existence of cells that produce AP that forms amyloid plaques in the brain. 



1 39. A vector comprising a polynucleotide according to claim 22. 



15 1 40. A host cell comprising a vector according to claim 1 39. 



141 . A purified polypeptide comprising the amino acid sequence set forth in 
SEQIDNO: 8. 



20 142. A polypeptide comprising an amino acid sequence at least 95% 

identical to a polypeptide according to any one of claims 42-61 , wherein said 
polypeptide lacks a transmembrane domain and retains p-secretase activity of a human 
Asp2 protein. 



25 1 43. A method according to claim 83, wherein the first composition 

comprises a human Asp2 polypeptide of any one of claims 1-13,1 9-24, 26-27 or 47- 
62. 
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144. A method according to claim 124 wherein the Hu-Asp2 is purified and 
isolated. 

145. A method according to claim 124, wherein the Hu-Asp2 is encoded by 
5 a nucleic acid that hybridizes under stringent hybridization conditions to the 

complement of a Hu-Asp2-encoding polynucleotide selected from the group 
consisting of SEQ ID NO: 3 and SEQ ID NO: 5. 

146. A method according to claim 1 24, wherein the Hu-Asp2 is selected 
1 0 from the group consisting of: 

(a) Hu-Asp2(a) comprising the amino acid sequence set forth in SEQ ID 

NO: 4; 

(b) Hu-Asp2(b) comprising the amino acid sequence set forth in SEQ ID 
NO: 6; and 

15 (c) fragments of Hu-Asp2(a) (SEQ ID NO: 4) and Hu-Asp2(b) (SEQ ID 

NO: 6) that cleave the APP substrate at a P-secretase cleavage site. 

147. A method according to claim 87, wherein the Hu-Asp2 comprises an 
amino acid sequence at least 95% identical to an amino acid sequence selected from 

20 the group consisting of SEQ ID NOS: 4 and 6. 

148. A method according to claim 146, wherein the Hu-Asp2 comprises a 
soluble fragment of Hu-Asp2(a) or Hu-Asp2(b) that lacks an Asp2 transmembrane 
domain. 



25 



149. A method according to claim 1 48, wherein the Hu-Asp2 has an amino 
acid sequence consisting of a sequence-selected from the group consisting of SEQ ID 
NOS: 30. 32, 51, and 53. 
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1 50. A method according to claim 148, wherein the Hu-Asp2 comprises a 
fragment of Hu-Asp2(a) or Hu-Asp2(b), wherein the Hu-Asp 2 lacks amino acids 1-45 
ofSEQIDNOS:4or 6. 

5 
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FIGURE XA 



ATGGGCGCACTGGCCCGG GCGCTGCTGCTG CCTCTGCTGGCC CAGTGGCTCCTG CGCGCC 
MGALARALLLPLLAQWLLRA 
CCCCGG AGCTGGCCCCCG CGCCCTTCACGC TGCCCCTCCGGG TGGCCGCGGCCA CGAAC 
APELAPAPFTLPLRVAAATN 
CGCGTAGTTGCGCCCACC CCGGGACCCGGG ACCCCTGCCGAG CGCCACGCCGAC GGCT7G 
RVVAPT FGPGTPAERHADGL 
GCGCTCGCCCTGGAGCCT GCCCTGGCGTCC CCCGCC-GGCGCC GCCAACTTCTTG GCCATG 
ALALEPALAS PAGAANFLAH 
GTAGAC AACCTGCAGGGG GACTCTGGCCGC GGCTACTACCTG GAGATGCTGATC GGGACC 
VDNLQGDSGRGYYLEMLI GT 

CCCCCG CAGAAGCTACAG ATTCTCGTTGAC ACTGGAAGCAGT AACTTTGCCGTG GC AGGA 
PPOKLQ ILVDTGSSNFAVAG 

ACCCCG CACTCCTACATA GACACGTACTTT GACACAGAGAGG TCTAGCACATAC CGCTCC 
TPHSYI DTYFDTERSSTYRS 

AAGGGC TTTGACGTCACA GTGAAGTACACA CAAGGAAGCTGG ACGGGCTTCGTT GGGGAA 
KG FDVT VKYTQGSWTGFVGE 

GACCTCGTCACCATCCCC AAAGGCTTCAAT ACTTCTTTTCTT GTCAACATTGCC ACTATT 
DLVTIP KGFNT SFLVNIATI 

TTTGAATCAGAGAATTTC TTTTTGCCTGGG ATTAAATGGAAT GGAATACTTGGC CTAGCT 
FESENF FLPG IKWNGILGLA 

TATGCC ACACTTGCCAAG CCATCAAGTTCT CTGGAGACCTTC TTCGACTCCCTG GTGACA 
YATLAK PSSS LETF FDSLVT 

CAAGCA AACATCCCCAAC GTTTTCTCCATG CAGATGTGTGGA GCCGGCTTGCCC GTTGCT 
QANIPN VFS MQMCGAGLP VA 

GGATCTGGGACCAACGGA GGTAGTCTTGTC TTGGGTGGAATT GAACCAAGTTTG TATAAA 
GSGTNGGSLVLGGI EPSLYK 

GGAGAC ATCTGGTATACC CCTATTAAGGAA GAGTGGTACTAC CAGATAGAAATT CTGAAA 
GDIWYT PIKEEWYYQIEI LK 

TTGGAA ATTGGAGGCCAA AGCCTTAATCTG GACTGCAGAGAG TATAACGCAGAC AAGGCC 
LE IGGO SLNLDCREYNADKA 

ATCGTGGACAGTGGCACC ACGCTGCTGCGC CTGCCCCAGAAG GTGTTTGATGCG GTGGTG 
IVDSGTTLLRLPQKVFDAVV 

GAAGCTGTGGCCCGCGCA TCTCTGATTCCA GAATTCTCTGAT GGTTTCTGGACT GGGTCC 
EAVARASLIPEFSDGFWTGS 

CAGCTGGCGTGCTGGACG AATTCGGAAACA CCTTGGTCTTAC TTCCCTA/AATC TCCATC 
QLACWTNSETPWSYFPKISI 

TACCTG AGAGATGAGAAC TCCAGCAGGTCA TTCCGTATCACA ATCCTGCCTCAG CTTTAC 
YLRDEN SSRS FRIT ILPQLY 

ATTCAG CCCATGATGGGG GCCGGCCT6AAT TATGAATGTTAC CGATTCGGCATT TCCCCA 
IQPMMGAGLNYECYRFGI SP 

TCCACA AATGCGCTGGTG ATCGGTGCCACG GTGATGGAGGGC TTCTACGTCATC TTCGAC 
STNALV IGAT VMEG FYVI FD 



AGAGCC CAGAAGAGGGTG GGCTTCGCAGCG AGCCCCTGTGCA GAAATTGCAGGT GCTGCA 
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FIGURE IB 



RAQKRVGFAASPCAEIAGAA 

GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 
VSBISG PFSTEDVASNCVPA 

CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 
QSLSEPI LWIVSYALMSVCG 

GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 
AILLVLIVLLLLPFRCQRRP 

CGTGACCCTGAGGTCGTCAATGATGAGTCCTCTCTGGTCAGACATCGCTGGAAATGAATA 
RDPEVVNDESSLVRHRWK 

GCCAGGCCTGACCTCAAGCAACCATGAACTCAGCTATTAAGAAAATCACATTTCCAGGGC 
AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 
GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 
CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAAACCTVAAAAAAAAAA 
AAAA 
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FIGURE 2A 



ATGGCCCAAGCCCTGCCC TGGCTCCTGCTG TGGATGGGCGCG GGAGTGCTGCCT GCCCAC 
MAOALP WLLLWMGAGVLPAH 

GGCACCCAGCACGGCATC CGGCTGCCCCTG CGCAGCGGCCTG GGGGGCGCCCCC CTGGGG 
GTQHGI RLPLRSGLGGAPLG 

CTGCGGCTGCCCCGGGAG ACCGACGAAGAG CCCGAGGAGCCC GGCCGGAGGGGC AGCTTT 
LRLPRETDEEPEEPGRRGSF 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 
VEMVDN LRGKSGQGYYVEMT 

GTGGGC AGCCCCCCGCAG ACGCTCAACATC CTGGTGGATACA GGCAGCAGTAAC TTTGCA 
VGSPPOTLNILVDTGSSNFA 

GTGGGTGCrGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 
VGAAPH PFLHRYYQ ROLSST 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 
YRDLRKGVYVPYTQGKWEGE 

CTGGGC ACCGACCTGGTA AGCATCCCCCAT GGCCCCAACGTC ACTGTGCGTGCC AACATT 
LGTDLV SIPHGPNVTVRANI 

GCTGCC ATCACTGAATCA GACAAGTTCTTC ATCAACGGCTCC AACTGGGAAGGC ATCCTG 
AAITESDKFFINGSNWEGIL 

GGGCTGGCCTATGCTGAG ATTGCCAGGCTT TGTGGTGCTGGC TTCCCCCTCAAC CAGTCT 
GLAYAE I. ARLCGAGFPLNOS 

GAAGTGCTGGCCTCTGTCGGAGGGAGCATG ATCATTGGAGGT ATCGACCACTCG CTGTAC 
EVLASVGGSMIIGGIDHSLY 

ACAGGCAGTCTCTGGTAT ACACCCATCCGG CGGGAGTGGTATTATGAGGTGATC ATTGTG 
TGSLWYTPIRREWYYEVI IV 

CGGGTGGAGATCAATGGA CAGGATCTGAAA ATGGACTGCAAG GAGTACAACTAT GACAAG 
RVEINGODLKMDCKEYNYDK 

AGCATTGTGGACAGTGGC ACCACCAACCTT CGTTTGCCCAAG AAAGTGTTTGAA GCTGCA 
SIVDSGTTNLRLPKKVFEAA 

GTCAAATCCATCAAGGCAGCCTCCTCCACG GAGAAGTTCCCT GATGGTTTCTGG CTAGGA 
VKSIKAASSTEKFPDGFWLG 

GAGCAGCTGGTGTGCTGG CAAGCAGGCACC ACCCCTTGGAAC ATTTTCCCAGTC ATCTCA 
EOLV.CW QAGTTPWN I FPV I S 

CTCTACCTAATGGGTGAG GTTACCAACCAG TCCTTCCGCATC ACCATCCTTCCG CAGCAA 
LYLMGEVTNQSFRI TI LPQQ 

TACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGTTACAAGTTTGCCATC 
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YLRPVEDVAT30DDCYKFAI 

TCACAGTCATCCACGGGC ACTGTTATGGGA GCTGTTATCATG GAGGGCTTCTAC GTTGTC 
SQSSTGTVMGAVIMEGFYVV 

TTTGAT CGGGCCCGAAAA CGAATTGGCTTT GCTGTCAGCGCT TGCCATGTGCAC GATGAG 
FDRARKRIGFAVSACHVH DE 

TTCAGG ACGGCAGCGGTG GAAGGCCCTTTT GTCACCTTGGAC ATGGAAGACTGT GGCTAC 
FRTAAVEGPFVTLDMEDCGY 

AACATT CCACAGACAGAT GAGTCAACCCTC ATGACCATAGCC TATGTCATGGCT GCCATC 
NI PQTDESTLMTIAYVMAAI 

TGCGCC CTCTTCATGCTG CCACTCTGCCTC ATGGTGTGTCAG TGGCGCTGCCTC CGCTGC 
CALFML PLCLMVCQWRCL RC 

CTGCGC CAGCAGCATGAT GACTTTGCTGAT GACATCTCCCTG CTGAAGTGAGGA GGCCCA 
LRQQHD DFADDISLLK 

TGGGCAGAAGAtAGAGAT TCCCCTGGACCA CACCTCCGTGGT TCACTTTGGTCA CAAGTA 
GGAGACACAGATGGCACC TGTGGCCAGAGC ACCTCAGGACCC TCCCCACCCACC AAATGC 
CTCTGCCITGATGGAGAA GGAAAAGGCTGG CAAGGTGGGTTC CAGGGACTGTAC CTGTAG 
GAAACAGAAAAGAGAAGA AAGAAGCACTCT GCTGGCGGGAAT ACTCTTGGTCAC CTCAAA 
TTTAAGTCGGGAAATTCT GCTGCTTGAAAC TTCAGCCCTGAA CCTTTGTCCACC ATTCCT 
TTAAATTCTCCAACCCAA AGTATTCTTCTT TTCTTAGTTTCA GAAGTACTGGCA TCACAC 
GCAGGTTACCTTGGCGTG TGTCCCTGTGGT ACCCTGGCAGAG AAGAGACCAAGC TTGTTT 
CCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATTTGCTTTAGAGACAGG 
GACTGT ATAAACAAGCCT AACATTGGTGCA AAGATT6CCTCT TGAAAAAAAAAA AAA 
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FIGURE 3A 



ATGGCCCAAGCCCTGCCC TGGCTCCTGCTG TGGATGGGCGCG GGAGTGCTGCCT GCCCAC 
MA QALP WLLL WMGAGVLP AH 

GGCACCCAGCACGGCATC CGGCTGCCCCTG CGCAGCGGCCTG GGGGGCGCCCCC CTGGGG 
GTQHGI RLPLRSGLGGAP LG 

CTGCGG CTGCCCCGGGAG ACCGACGAAGAG CCCG AGGAGCCC GGCCGGAGGGGC AGCTTT 
LRLPRETDEEPEEPGRRGSF 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 
VEMVDNLRGKSGOGYYVEMT 

GTGGGC AGCCCCCCGCAG ACGCTCAACATC CTGGTGGATACA GGCAGCAGTAAC TTTGCA 
VGSPPQTLNI LVDTGSSN FA 

GTGGGTGCTGCCCCCCAC CCCTTCCTGCAT CGCTACTACCAG AGGCAGCTGTCC AGCACA 
VGAAPH PFLH RYYORQLS ST 

TACCGGGACCTCCGGAAG GGTGTGTATGTG CCCTACACCCAG GGCAAGTGGGAA GGGGAG 
YRDLRKGVYV PYTQGKWE GE 

CTGGGC ACCGACCTGGTA AGCATCCCCCAT GGCCCCAACGTC ACTGTGCGTGCC AACATT 
LGTDLVSIPHGPNVTVRANI 

GCTGCC ATCACTGAATCA GACAAGTTCTTC ATCAACGGCTCC AACTGGG AAGGC ATCCTG 
AAITESDKFFINGSNWEG IL 

GGGCTGGCCTATGCTGAG ATTGCCAGGCCT GACGACTCCCTG GAGCCTTTCTTT GACTCT 
GLAYAB lARP DDSL EPFF DS 

CTGGTA AAGCAGACCCAC GTTCCCAACCTC TTCTCCCTGCAG CTTTGTGGTGCT GGCTTC 
LVKQTHVPNL FSLQLCGAGF 

CCCCTC AACCAGTCTGAA GTGCTGGCCTCT GTCGGAGGGAGC ATGATCATTGGA GGTATC 
PLNQSEVLAS VGGSMIIGGI 

GACCACTCGCTGTACACAGGCAGTCTCTGG TATACACCCATC CGGCGGG AGTGG TATTAT 
DHSLYTGSLWYTPIRREWYY 

GAGGTC ATCATTGTGCGG GTGGAGATCAAT GGACAGGATCTG AAAATGGACTGC AAGGAG 
EVIIVRVEIN GQDLKMDC KE 

TACAACTATGACAAGAGC ATTGTGGACAGT GGCACCACCAAC CTTCGTTTGCCC AAGAAA 
YNYDKSIVDSGTTNLRLPKK 



GTGTTT6AAGCTGCAGTC AAATCCATCAAG GCAGCCTCCTCC ACGGAGAAGTTC CCTGAT 
VFEAAVKSIKAASSTEKF PD 
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GGTTTC TGGCTAGGAGAG CAGCTGGTGTGC TGGCAAGCAGGC ACCACCCCTTGG AACATT 
GFWLGEQLVCWQAGTTPWNI 

TTCCCAGTCATCTCACTC TACCTAATGGGT GAGGTTACCAAC CAGTCCTTCCGC ATCACC 
FPVISLYLMG EVTNQSFR I T 

ATCCTT CCGCAGCAATAC CTGCGGCCAGTG GAAGATGTGGCC ACGTCCCAAGAC GACTGT 
ILPQOYLRPVEDVATSODDC 

TACAAGTTTGCCATCTCA CAGTCATCCACG GGCACTGTTATG GGAGCTGTTATC ATGGAG 
YKFAISQSSTGTVMGAVIME 

GGCTTCTACGTTGTCTTT GATCGGGCCCGA AAACGAATTGGCTTTGCTGTCAGC GCTTGC 
GFYVVFDRAR KRIGFAVSAC 

CATGTG CACGATGAGTTC AGGACGGCAGCG GTGGAAGGCCCT TTTGTCACCTTG GACATG 
HVHDEFRTAAVEGPFVTLDM 

GAAGACTGTGGCTACAAC ATTCCACAGACA GATGAGTCAACC CTCATGACCATA GCCTAT 
EDCGYN IPQTDESTLMTI AY 

GTCATGGCTGCCATCTGCGCCCTCTTCATG CTGCCACTCTGC CTCATGGTGTGT CAGTGG 
VMAAI CALFM LPLCLMVCQW 

CGCTGCCTCCGCTGCCTG CGCCAGCAGCAT GATGACTTTGCT GATGACATCTCC CTGCTG 
RCLRCLRQOH DDFADDI S LL 

AAGTGAGGAGGCCCATGG GCAGAAGATAGAGATTCCCCTGGACCACACCTCCGT GGTTCA 
K 

CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGAC^ 
CCACCC ACCAAATGCCTC TGCCTTGATGGA GAAGGAAAAGGC TGGCAAGGTGGG TTCCAG 
GGACrGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAATACT 
CTTGGTCACCTCAAATTT AAGTCGGGAAAT TCTGCTGCTTGA AACTTCAGCCCT GAACCT 
TTGTCC ACCATTCCTTTA AATTCTCCAACC CAAAGTATTCTT CTTTTCTTAGTT TCAGAA 
GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 
AGACCA AGCTTGTTTCCC TGCTGGCCAAAG TCAGTAGGAGAG GATGCACAGTTT GCTATT 
TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCC TCTTGA 
ATT AAA AAAAAAAAAAAA AAAAAAAAAAAA 
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FIGURE 4 

ATGGCCCCAGCGCTGCA CTGGCTCCTGCT ATGGGTGGGCTC GGGAATGCTGCC TGCCCAG 
MA PALK WLLL WVGS GMLP AQ 
GGAACCCATCrrCGGCATCCGGCTGCCCCTTCGCAGCGGCCTGGCAGGGCCACCCCTGGGC 
GTHLGI RLPLRSGLAGPPLG 
CTGAGGCTGCCCCGGGA GACTGACGAGGA ATCGGAGGAGCC TGGCCGGAGAGG CAGCTTT 
LRLPRETDEESEEPGRRGSF 
GTGGAGATGGTGGACAA CCTGAGGGGAAA GTCCGGCCAGGG CTACTATGTGGA GATGACC 
VEMVDN LRGK SGQG YYVE MT 
GTAGGCAGCCCCCCACA GACGCTCAACAT CCTGGTGGACAC GGGCAGTAGTAA CTTTGCA 
VGSPPCTLNI LVDTGSSN FA 
GTGGGGGCTGCCCCACA CCCTTTCCTGCA TCGCTACTACCA GAGGCAGCTGTC CAGCACA 
VGAAPH PFLHRYYQRQLSST 
TATCG AGACCTCCGAAA GGGTGTGTATGT GCCCTACACCCA GGGCAAGTGGG A GGGGGAA 
YRDLRKGVYVPYTQGKWEGE 
CTGGGCACCGACCTGGT GAGCATCCCTCA TGGCCCCAACGT CACTGTGCX3TGC CAACATT 
LGTDLVSIPHGPNVTVRANI 
GCTGCCATCACTGAATC GGACAAGTTCTT CATCAATGGTTC CAACTGGGAGGG CATCCTA 
AA I TES DKFF INGS NWEG IL 
GGGCTGGCCTATGCTG A GATTGCCAGGCC CGACGACTCTTT GGAGCCCTTCTT TGACTCC 
GLAYAE lARPDDSLEPFFDS 
CTGGTGAAGCAGACCCA CATTCCCAACAT CTTTTCCCTGCA GCTCTGTGGCGC TGGCTTC 
LVKQTH IPNI FSLQLCGAGF 
CCCCT GAACC AGACCG A GGCACTGGCCTC GGTGGGAGGGAG CATGATCATTGG TGGTATC 
PLNQTE ALAS VGGS MI IGGI 
GACCACTCGCTATACAC GGGCAGTCTCTG GTACACACCCAT CCGGCGGGAGTG GTATTAT 
DHSLYTGSLWYTPI RREWYY 
GAAGTGATCATTGTACG TGTGGAAATCAA TGGTCAAGATCT CAAGATGGACTG CAAGGAG 
EVI IVR VEINGQDL KMDCKE 
TACAACTACGACAAGAG CATTGTGGACAG TGGGACCACCAA CCTTCGCTTGCC CAAGAAA 
YNYDKS IVDS GTTN LRLP KK 
GTATTTGAAGCTGCCGT CAAGTCCATCAA GGCAGCCTCCTC GACGGA6AAGTT CCCGGAT 
VFEAAVKSIKAASSTEKF PD 
GGCTTTTGGCTAGGGGAGO^GCTGGTGTGCrGGCAAGCAGGCACGACCCCTTGGAA^ 
GFWLGEOLVCWQAGTTPWNI 
TTCCCAGTCATTTCACT TTACCTCATGGG TGAAGTCACCAA TCAGTCCTTCCG CATCACC 
FPVISL YLMG EVTN QSFR IT 
ATCCTTCCTCAGCAATA CCTACGGCCGGT GGAGGACGTGGC CACGTCCCAAGA C6ACTGT 
ILPOOY LRPVEDVATSODDC 
TACAAGTTCGCTGTCTC ACAGTCATCCAC GGGCACTGTTAT GGGAGCCGTCAT CATGGAA 
YKFAVS QSST GTVMGAVI ME 
GGTTTCTATGTCGTCTT CGATCGAGCCCG AAAGCGAATTGG CTTTGCTGTCAG CGCTTGC 
GFYVVF DRAR KRIG FAVSAC 
CATGTGCACGATGAGTT CAGGACGGCGGC AGTGGAAGGTCC GTTTGTTACGGC AGACATG 
HVHDEF RTAA VEGP FVTA DM 
GAAGACTGTGGCTACAA CATTCCCCAGAC AGATGAGTCAAC ACTTATGACCAT AGCCTAT 
EDCGYN IPOTDESTLMTI AY 
GTCATGGCGGCCATCTG CGCCCTCTTCAT GTTGCCACTCTG CCTCATGGTATG TCAGTGG 
VMAAIC ALFM LPLCLMVC QW 
CGCTGCCTGCGTTGCCT GCGCCACCAGCA CGATGACTTTGC TGATGACATCTC CCTGCTC 
RCLRCLRHQH DDFADDISLL 
AAGTA AGGAGGCTCGTG GGCAGATGATGG AGACGCCCCTGG ACCACATCTGGG TGGTTCC 
K 

CTTTGGTCACATGAGTT GGAGCTATGGAT GGTACCTGTGGC CAGAGCACCTCA GGACCCT 
CACCAACCTGCCAATGC TTCTGGCGTGAC AGAACAGAGAAA TCAGGCAAGCTG GATTACA 
GGGCTTGCACCTGTAGG ACACAGGAGAGG GAAGGAAGCAGC GTTCTGGTGGCA GGAATAT 
CCTTAGGCACCACAAAC TTGAGTTGGAAA TTTTGCTGCTTG AAGCTTCAGCCC TGACCCT 
CTGCCCAGCATCCTTTA GAGTCTCCAACC TAAAGTATTCTT TATGTCCTTCCA GAAGTAC 
TGGCGTCATACTCAGGC TACCCGGCATGT GTCCCTGTGGTA CCCTGGCAGAGA AAGGGCC 
AATCT CATTCCCTGCTG GCCAAAGTCAGC AGAAGAAGGTGA AGTTTGCCAGTT GCTTTAG 
TGATAGGGACTGCAGAC TCAAGCCTACAC TGGTACAAAGAC TGCGTCTTGAGA TAAACAA 
GAA 
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1 MAQALPWLLLWMGAGVLPAHGTQHGIRLPLRSGLGGAPLGLRLPRETDEE 50 

II II llil|.|.hllMI llllllllll I lllllllllllll 
1 MAPALHWLLLWVGSGMLPAQGTHLGIRLPLRSGLAGPPLGLRLPRETDEE 50 

51 PEEPGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFA 100 

IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIi 
51 SEBPGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFA 100 

101 VGAAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPH 150 

IIMMIMMIIIIIIIIIIIMIIIIIMIIIIIIIilllllMIIII 

101 VGAAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPH 150 

151 GPNVTVRANIAAITESDKFFINGSNWEGIIiGLAYAEIARPDDSLEPFFDS 200 

lllliMllllllllllllllllllllllllllllllllliMillllll 
151 GPNVTVRANIAAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDS 200 

201 LVKQTHVPNLFSLQLCGAGFPLNQSEVLASVGGSMIIGGIDHSLYTGSLW 250 

MlllhlhlllllllllMIIM IIIIIIIMIIIIIIMIIIIII 

201 LVKQTHIPNIFSLQLCGAGFPLNQTBALASVGGSMIIGGIDHSLYTGSLW 250 

251 YTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTTNLRLPKK 300 

IIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIillllilllillllllll 
251 YTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTTNLRLPKK 300 

301 VFEAAVKSIKAASSTBKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMG 350 

llllllllilllllllillMIIIIIIIIIIIMIIIMIIIIIIMIII 

301 VFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMG 350 
351 EVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGTVMGAVIME 400 

Mil Mill II I III Mill llllllllll! II hlllllllllll II II 

351 EVTNQSPRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGTVMGAVIME 400 
401 GFYWFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMEDCGYNIPQT 450 

IIIIMIIIIIIIIMIIIIIIIMIIIIIIIMIII IIMIIIIMM 

401 GFYWFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTADMEDCGYNIPQT 450 
451 DESTLMTIAYVMAAICALFMLPLCLMVCQWRCLRCLRQQHDDFADDISLL 500 

IIIIIMIIIMIIIillllillllllllllMIIM llllllllllli 

451 DESTLMTIAYVMAAICALFMLPLCLMVCQWRCLRCLRHQHDDFADDISLL 500 



501 K 501 
I 

501 K 501 
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ATGGCTAGC ATGACTGGTGGA CAGCAAATGGGT CGCGGATCCACC CAGCACGGCATC CGG 
MAS MTGG QQMG RGS T Q H G I R 

CTGCCCCTG CGCAGCGGCCTG GGGGGCGCCCCC CTGGGGCTGCGG CTGCCCCGGGAG ACC 
LPL RSGLGGAP LGLR LPRE T 

GACGAAGAG CCCGAGGAGCCC GGCCGGAGGGGC AGCTTTGTGGAG ATGGTGGACAAC CTG 
DEE P EE? GRRG SFVE MVDN L 

AGGGGCAAG TCGGGGCAGGGC TACTACGTGGAG ATGACCGTGGGC AGCCCCCCGCAG ACQ 
RGK SGOG YYVE MTVG S P PQ T 

CTCAACATC CTGGTGGATACA GGCAGCAGTAAC TTTGCAGTGGGT GCTGCCCCCCAC CCC 
LNI LVDTGSSNFAVGAAPH P 

TTCCTGCAT CGCTACTACCAG AGGCAGCTGTCC AGCACATACCGG GACCTCCGGAAG GGC 
FLH RYYO RQliS STYR DLRK G 

GTGTATGTG CCCTACACCCAG GGCAAGTGGGAAGGGGAGCTGGGC ACCGACCTGGTA AGC 
VYV PYTO GKWEGELG TDLV S 

ATCCCCCAT GGCCCCAACGTC ACTGTGCGTGCC AACATTGCTGCC ATCACTGAATCA GAC 
IPH GPNV TVRANIAA ITES D 

AAGTTCTTC ATCAACGGCTCC AACTGGGAAGGC ATCCTGGGGCTG GCCTATGCTGAG ATT 
KFF INGSNWEG ILGLAYAE I 

GCCAGGCCTGACGACTCCCTG GAGCCTTTCTTT GACTCTCTGGTA AAGCAGACCCAC GTT 
ARP DDSL E PFF DSLV KQTH V 

CCCAACCTC TTCTCCCTGCAG CTTTGTGGTGCT GGCTTCCCCCTC AACCAGTCTGAA GTG 
PNL FSLO LCGAGFPLNQSE V 

CTGGCCTCT GTCGGAGGGAGC ATGATCATTGGA GGTATCGACCAC TCGCTGTACACA GGC 
LAS VGGS MIIGGIDH SLYTG 

AGTCTCTGG TATACACCCATC CGGCGGGAGTGG TATTATGAGGTC ATCATTGTGCGG GTG 
SLWYTPI RREWYYEVIIVRV 

GAGATCAAT GGACAGGATCTG AAAATGGACTGC AAGGAGTACAAC TATGACAAGAGC ATT 
BINGQDLKMDCKEYNYDKS I 

GTGGACAGT GGCACCACCAAC CTTCGTTTGCCC AAGAAAGTGTTT GAAGCTGCAGTC AAA 
VDS GTTN LRLP KKVF EAAV K 

TCCATCAAG GCAGCCTCCTCC ACGGAGAAGTTC CCTGATGGTTTC TGGCTAGGAGAG CAG 
SIKAASS TEKF PDGFWLGE 0 

CTGGTGTGC TGGCAAGCAGGC ACCACCCCTTGG AACATTTTCCCA GTCATCTCACTC TAG 
LVCWQAGTTPWNIFPVISL Y 

CTAATGGGTGAGGTTACCAAC CAGTCCTTCCGC ATCACCATCCTT CCGCAGCAATAC CTG 
LMGEVTNQSFR ITILPQQY L 



CGGCCAGTGG AAGATGTGGCCA CGTCCCAAGACG ACTGTTACAAGT TTGCCATCTCAC AG 
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RPVEDVATSODDCYKFAISO 

TCATCCACGG GCACTGTTATGG GAGCTGTTATCA TGGAGGGCTTCT ACGTTGTCTTTG AT 
SSTGTVMGAVIMEGFYVVFD 

CGGGCCCGAA AACGAATTGGCT TTGCTGTCAGCG CTTGCCATGTGC ACGATGAGTTCA GG 
RARK RIGF AVSACHVH DEFR 

ACGGCAGCGG TGGAAGGCCCTT TTGTCACCTTGG ACATGGAAGACT GTGGCTACAACA TT 
TAAVEGPFVTLDMEDCGYNI 

CCACAGACAG ATG AGTCATGA 
P 0 T D E S * 
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FIGURE 7A 

ATGGCTAGC ATGACTGGTGGA CAGCAAATGGGT CGCGGATCGATG ACTATCTCTGAC TCT 
MAS MTGGQQMGRGSMTISD S 

CCGCGTGAA CAGGACGGATCC ACCCAGCACGGC ATCCGGCTGCCC CTGCGCAGCGGC CTG 
PRE Q D G S TOHGIRLPLRSGL 

GGGGGCGCC CCCCTGGGGCTG CGGCTGCCCCGG GAGACCGACGAA GAGCCCGAGGAG CCC 
GGA PLGLRLPRETDE EPEE P 

GGCCGGAGG GGCAGCTTTGTG GAGATGGTGGAC AACCTGAGGGGC AAGTCGGGGCAG GGC 
GRR GSFVEMVDNLRG KSGQG 

TACTACGTG GAGATGACCGTG GGCAGCCCCCCG CAGACGCTCAAC ATCCTGGTGGAT ACA 
YYV EMTVGSPPQTLN ILVDT 

GGCAGCAGT AACTTTGCAGTG GGTGCTGCCCCC CACCCCTTCCTG CATCGCTACTAC CAG 
GSS NFAV GAAP H P FL HRYY Q 

AGGCAGCTG TCCAGCACATAC CGGGACCTCCGG AAGGGCGTGTAT GTGCCCTACACC CAG 
RQL SSTY RDLR KGVY VPYTQ 

GGCAAGTGG GAAGGGGAGCTG GGCACCGACCTG GTAAGCATCCCC CATGGCCCCAAC GTC 
GKW EGELGTDLVSIP HGPNV 

ACTGTGCGT GCCAACATTGCT GCCATCACTGAA TCAGACAAGTTC TTCATCAACGGC TCC 
TVR ANIA AITE SDKF FING S 

AACTGGGAA GGCATCCTGGGG CTGGCCTATGCT GAGATTGCCAGG CCTGACGACTCC CTG 
NWE GILGLAYA EIAR PDDS L 

GAGCCTTTC TTTGACTCTCTG GTAAAGCAGACC CACGTTCCCAAC CTCTTCTCCCTG CAG 
EPF FDSLVKQT HVPN LFSLQ 

CTTTGTGGT GCTGGCTTCCCC CTCAACCAGTCT GAAGTGCTGGCC TCTGTCGGAGGG AGO 
LCG AG FP LNQS EVLA SVGG S 

ATGATCATT GGAGGTATCGAC CACTCGCTGTAC ACAGGCAGTCTC TGGTATACACCC ATC 
Mil GGIDHSLYTGSLWYTP I 

CGGCGGGAG TGGTATTATGAG GTCATCATTGTG CGGGTGGAGATC AATGGACAGGAT CTG 
RREWYYEVIIVRVEI NGQDL 

AAAATGGAC TGCAAGGAGTAC AACTAT6ACAAG AGCATTGTGGAC AGTGGCACCACC AAC 
KMDCKEYNYDKSIVD SGTTN 

CTTCGTTTG CCCAAGAAAGTG TTTGAAGCTGCA GTCAAATCCATC AAGGCAGCCTCC TCC 
LRL PKKV FEAAVKSI KAAS S 

ACGGAGAAG TTCCCTGATGGT TTCTGGCTAGGA GAGCAGCTGGTG TGCTGGCAAGCAGGC 
TEK FPDG FWLGEQLVCWQAG 

ACCACCCCTT GGAACATTTTCC CAGTCATCTCAC TCTACCTAATGG GTGAGGTTACCA AC 
TTPWNIFPVISLYLMGEVTN 
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FIGURE 7B 



CAGTCCTTCC GCATCACCATCC TTCCGCAGCAAT ACCTGCGGCCAG TGGAAGATGTGG CC 
OSFR ITILPQQYLRPVEDVA 

ACGTCCCAAG ACGACTGTTACA AGTTTGCCATCT CACAGTCATCCA CGGGCACTGTTA TG 
TSQDDCYKFAISOSSTGTVM 

GGAGCTGTTA TCATGGAGGGCT TCTACGTTGTCT TTGATCGGGCCC GAAAACGAATTG GC 
GAVI MEGFYVVFDRARKRIG 

TTTGCTGTCA GCGCTTGCCATG TGCACGATGAGT TCAGGACGGCAG CGGTGGAAGGCC CT 
FAVS ACHV HDEF RTAAVEGP 

TTTGTCACCr TGGACATGGAAG ACTGTGGCTACA ACATTCCACAGA CAGATGAGTCAT GA 
FVTL DMEDCGYN IPQTDES* 
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ATGACTCAGCATGG TATTCGTCTGCC ACTGCGTAGCGG TCTGGGTGGTGC TCCACTGGGT 
MTOHGIRLPLRSGLGGAPLG 

CTGCGTCTGCCCCG GGAGACCGACGA AGAGCCCGAGGA GCCCGGCCGGAG GGGCAGCTTT 
LRLPRETDEEPEEPGRRGSF 

GTGGAGATGGTGGA CAACCTGAGGGG c;iAGTCGGGGCAGGGCTACTACGTGGAGATGACC 
VEMVDNLRGKSGOGYYVBMT 

GTGGGCAGCCCCCC GCAGACGCTCAA CATCCTGGTGGA TACAGGCAGCAG TAACTTTGCA 
VGSPPQTLNILVDTGSSNFA 

GTGGGTGCTGCCCC CCACCCCTTCCT GCATCGCTACTA CCAGAGGCAGCT GTCCAGCACA 
VGAAPHPFLHRYYOROLSST 

TA CCGGGACCTCCG GAAGGGCGTGTA TGTGCCCTACAC CCAGGGCAAGTG GGAAGGGGAG 
YRDLRKGVYVPYT QGKWEGE 

CTGGGCACCGACCT GGTAAGCATCCC CCATGGCCCCAA CGTCACTGTGCG TGCCAACATT 
LGTDLVSIPHGPNVTVRANI 

GC TGCCATCACTGA ATCAGACAAGTT CTTCATCAACGG CTCCAACTGGGA AGGCATCCTG 
AAITESDKFFINGSNWEGIL 

GG GCTGGCCTATGC TGAGATTGCCAG GCCTGACGACTC CCTGGAGCCTTT CTTTGACTCT 
GLAYAEIARPDDSLEPFFDS 

CTGGTAAAGCAGAC CCACGTTCCCAA CCTCTTCTCCCT GCAGCTTTGTGG TGCTGGCTTC 
LVKQTHVPNLFSLQLCGAGF 

CCCCTCAACCAGTC TGAAGTGCTGGC CTCTGTCGGAGG GAGCATGATCAT TGGAGGTATC 
PLNQSEVLASVGGSMIIGGI 

GACCACTCGCTGTA CACAGGCAGTCT CTGGTATACACC CATCCGGCGGGA GTGGTATTAT 
DHSLYTGSLWYTPIRREWYY 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAAGGAG 
EVIIVRVEINGQDLKMDCKE 

TA CAACTATGACAA GAGCATTGTGGA CAGTGGCACCAC CAACCTTCGTTT GCCCAAGAAA 
YNYDKSIVDSGTTNLRLPKK 

GTGTTTGAAGCTGC AGTCAAATCCAT CAAGGCAGCCTC CTCCACGGAGAA GTTCCCTGAT 
VFEAAVKSIK AASSTEKFPD 

GG TTTCTGGCTAGG AGAGCAGCTGGT GTGCTGGCAAGC AGGCACCACCCC TTGGAACATT 
GFWLGEQLVCWQAGTTPWNI 

TTCCCAGTCATCTC ACTCTACCTAAT GGGTGAGGTTAC CAACCAGTCCTT TCGCATCACC 
FPVISLYLMGEVTNQSFRIT 

ATCCTTCCGCAGCA ATACCTGCGGCC AGTGGAAGATGT GGCCACGTCCCA AGACGACTGT 
ILPQOYLRPVEDVATSQDDC 
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FIGURE SB 



T A C AAGTTTGCCAT CTCACAGTCATC CACGGGCACTGT TATGGGAGCTGT TATCATGG AG 
YKFAI SOSSTGTVMGAVIME 

GGCTTCTACGTTGT CTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGT CAGCGCTTGC 
GFYVV FDRARKRI GFAVSAC 

CATTAG 
H * 
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FIGURE 9 
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FIGURE 10 
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FIGURE 11 

MAQALPWLLLWMGAGVIiPAHG TQHGIRLPLRSGLGGAPLGLRLPRETPEE 
PEEPGRRGSFVEMVDNLRGKSGOGYYVEMTVGSPPQTLNILVDTGSSNFA 
VGAAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPH 
GPNVTVRANIAAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDS 
LVKQTHVPNLFSLQLCGAGFPLNQSEVLASVGGSMIIGGIDHSLYTGSLW 
YTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTTNLRLPKK 
VFEAAVKSI KAASSTEKFPDGFWLGEQLVCWQAGTTPWNI FPVI SLYLMG 
EVTNQS FR I T I LPQQYLR PVEDVATS QDDC YKFA I SQS S TGTVMGAV I ME 
GFYWFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMEDCGYNIPQT 

DES 
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MAQALPWLLLWMGAGVLPAHG TQHGIRLPLRSGLGGAPLGLRLPRETDEE 
PEEPGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFA 
VGAJVPHPFLHRYYQRQLSSTYRDLRKGVYVPYTOGKWEGELGTDLVSIPH 
GPNVTVRANIAAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDS 
LVKQTHVPNLFSLQLCGAGFPLNQSEVLASVGGSMIIGGIDHSLYTGSLW 
YTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTTNLRLPKK 
VFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMG 
EVTNQS FR I T I LPQQYLRPVEDVATSQDDCYKFAI SQS STGTVMGAVI ME 
GFYWFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMEDCGYNIPQT 
DESHHHHHH 
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SEQUENCE LISTING 

<110> Beinkowski et al. 

cl20> ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR. AND USES 







<130> 


28341/6280M 


<140> 
<141> 




<150> 
<151> 


1999-10-13 


<150> 
<151> 


60/155,493 
1999-09-23 


<150> 
<151> 


09/404, 133 
1999-09-23 


<150> 
<151> 


PCT/US99/20881 
1999-09-23 


<150> 
<151> 


60/101,594 
1998-09-24 


<160> 


73 


<170> 


Patentin Ver. 


<210> 
<211> 
<212> 
<213> 


1 

1804 
DNA 

Homo sapiens 


<400> 


1 



atgggcgcac tggcccgggc gctgctgctg cctctgctgg cccagtggct cctgcgcgcc 60 
gccccggagc tggcccccgc gcccttcacg ctgcccctcc gggtggccgc ggccacgaac 120 
cgcgtagttg cgcccacccc gggacccggg acccctgccg agcgccacgc cgacggcttg 180 
gcgctcgccc tggagcctgc cctggcgtcc cccgcgggcg ccgccaactt cttggccatg 240 
gtagacaacc tgcaggggga ctctggccgc ggctactacc tggagatgct gatcgggacc 300 
cccccgcaga agctacagat tctcgttgac actggaagca gtaactttgc cgtggcagga 360 
accccgcact cctacataga cacgtacttt gacacagaga ggtctagcac ataccgctcc 420 
aagggctttg acgtcacagt gaagtacaca caaggaagct ggacgggctt cgttggggaa 480 
gacctcgtca ccatccccaa aggcttcaat acttcttttc ttgtcaacat tgccactatt 540 
tttgaatcag agaatttctt tttgcctggg attaaatgga atggaatact tggcctagct 600 
tatgccacac ttgccaagcc atcaagttct ctggagacct tcttcgactc cctggtgaca 660 
caagcaaaca tccccaacgt tttctccatg cagatgtgcg gagccggctt gcccgctgct 720 
ggatctggga ccaacggagg tagtcttgtc ttgggtggaa ttgaaccaag tttgtataaa 780 
ggagacatct ggtatacccc tattaaggaa gagtggtact accagataga aattctgaaa 840 
ttggaaattg gaggccaaag cctcaatctg gactgcagag agtataacgc agacaaggcc 900 
atcgtggaca gtggcaccac gctgctgcgc ctgccccaga aggtgtttga tgcggtggtg 960 
gaagctgtgg cccgcgcatc tctgattcca gaattctctg atggtttctg gactgggtcc 1020 
cagctggcgt gctggacgaa ttcggaaaca ccttggtcct acttccctaa aatctccatc 1080 
tacctgagag atgagaactc cagcaggtca tcccgtatca caatcctgcc tcagctttac 1140 
attcagccca tgatgggggc cggcccgaat tatgaatgtt accgattcgg cattccccca 1200 
tccacaaatg cgctggtgat cggtgccacg gtgatggagg gcttctacgt catcttcgac 1260 
agagcccaga agagggtggg cttcgcagcg agcccctgtg cagaaattgc aggtgctgca 1320 
gtgtctgaaa tttccgggcc tttctcaaca gaggatgtag ccagcaactg tgtccccgct 1380 
cagtctctga gcgagcccat tttgtggatt gtgtcctatg cgctcatgag cgtccgtgga 1440 
gccatcctcc ttgtcttaat cgtcctgctg ctgctgccgt tccggtgtca gcgtcgcccc 1500 
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cgtgaccctg aggtcgtcaa 
gccaggcctg acctcaagca 
agcagccggg atcgatggtg 
gctcccagat gccctctaga 
ctccctactt ccaagaaaaa 
aaaa 

<210> 2 
<211> 518 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Gly Ala Leu Ala Arg Ala Leu Leu Leu Pro Leu Leu Ala Gin Trp 

15 10 15 

Leu Leu Arg Ala Ala Pro Glu Leu Ala Pro Ala Pro Phe Thr Leu Pro 
20 25 30 

Leu Arg Val Ala Ala Ala Thr Asn Arg Val Val Ala Pro Thr Pro Gly 
35 40 45 

Pro Gly Thr Pro Ala Glu Arg His Ala Asp Gly Leu Ala Leu Ala Leu 
50 55 60 

Glu Pro Ala Leu Ala Ser Pro Ala Gly Ala Ala Asn Phe Leu Ala Met 
65 70 75 80 

Val Asp Asn Leu Gin Gly Asp Ser Gly Arg Gly Tyr Tyr Leu Glu Met 
85 90 95 

Leu He Gly Thr Pro Pro Gin Lys Leu Gin He Leu Val Asp Thr Gly 
100 105 110 

Ser Ser Asn Phe Ala Val Ala Gly Thr Pro His Ser Tyr He Asp Thr 

115 120 125 

Tyr Phe Asp Thr Glu Arg Ser Ser Thr Tyr Arg Ser Lys Gly Phe Asp 
130 135 140 

Val Thr Val Lys Tyr Thr Gin Gly Ser Trp Thr Gly Phe Val Gly Glu 
145 150 155 160 

Asp Leu Val Thr He Pro Lys Gly Phe Asn Thr Ser Phe Leu Val Asn 
165 170 175 



tgatgagtcc tctctggtca gacatcgctg gaaatgaata 1560 
accacgaact cagctattaa gaaaatcaca ttcccagggc 1620 
gcgctttctc ctgtgcccac ccgtcttcaa tctctgttct 1680 
ttcactgtct tttgattctt gattttcaag ctttcaaatc 1740 
taattaaaaa aaaaacttca ttctaaacca aaaaaaaaaa 1800 

1804 



He Ala Thr He Phe Glu Ser Glu 

180 

Trp Asn Gly He Leu Gly Leu Ala 
195 200 

Ser Ser Leu Glu Thr Phe Phe Asp 
210 215 

Pro Asn Val Phe Ser Met Gin Met 
225 230 

Gly Ser Gly Thr Asn Gly Gly Ser 
245 



Asn Phe Phe Leu Pro Gly He Lys 
185 190 

Tyr Ala Thr Leu Ala Lys Pro Ser 
205 

Ser Leu Val Thr Gin Ala Asn He 

220 

Cys Gly Ala Gly Leu Pro Val Ala 
235 240 

Leu Val Leu Gly Gly He Glu Pro 
250 255 
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Ser Leu Tyr Lys Gly Asp lie Trp Tyr Thr Pro lie Lys Glu 01 u Trp 
260 265 270 

Tyr Tyr Gin He Glu He Leu Lys Leu Glu He Gly Gly Gin Ser Leu 
275 280 285 

Asn Leu Asp Cys Arg Glu Tyr Asn Ala Asp Lys Ala He Val Asp Ser 
290 295 300 

Gly Thr Thr Leu Leu Arg Leu Pro Gin Lys Val Phe Asp Ala Val Val 
305 310 315 320 

Glu Ala Val Ala Arg Ala Ser Leu He Pro Glu Phe Ser Asp Gly Phe 
325 330 335 

Trp Thr Gly Ser Gin Leu Ala Cys Trp Thr Asn Ser Glu Thr Pro Trp 
340 345 350 

Ser Tyr Phe Pro Lys He Ser He Tyr Leu Arg Asp Glu Asn Ser Ser 
355 360 365 

Arg Ser Phe Arg He Thr He Leu Pro Gin Leu Tyr He Gin Pro Met 
370 375 380 

Met Gly Ala Gly Leu Asn Tyr Glu Cys Tyr Arg Phe Gly He Ser Pro 
385 390 395 400 

Ser Thr Asn Ala Leu Val He Gly Ala Thr Val Met Glu Gly Phe Tyr 
405 410 415 

Val He Phe Asp Arg Ala Gin Lys Arg Val Gly Phe Ala Ala Ser Pro 
420 425 430 

Cys Ala Glu He Ala Gly Ala Ala Val Ser Glu He Ser Gly Pro Phe 
435 440 445 

Ser Thr Glu Asp Val Ala Ser Asn Cys Val Pro Ala Gin Ser Leu Ser 
450 455 460 



Glu Pro He Leu Trp He Val Ser Tyr Ala Leu Met Ser Val Cys Gly 
465 470 475 480 

Ala He Leu Leu Val Leu He Val Leu Leu Leu Leu Pro Phe Arg Cys 
485 490 495 

Gin Arg Arg Pro Arg Asp Pro Glu Val Val Asn Asp Glu Ser Ser Leu 
500 505 510 

Val Arg His Arg Trp Lys 
515 



<210> 3 

<211> 2070 

<212> DNA 

<213> Homo sapiens 

<400> 3 

atggcccaag ccctgccctg gctcctgctg tggatgggcg cgggagtgct gcctgcccac 60 
ggcacccagc acggcatccg gctgcccctg cgcagcggcc tggggggcgc ccccctgggg 120 
ctgcggctgc cccgggagac cgacgaagag cccgaggagc ccggccggag gggcagcttt 180 
gtggagatgg tggacaacct gaggggcaag tcggggcagg gctactacgt ggagatgacc 240 



wo 01/49097 



PCT/IBOl/00797 



-4- 

gtgggcagcc ccccgcagac gctcaacatc ctggtggata caggcagcag taactttgca 300 
gtgggtgctg ccccccaccc cttcctgcat cgctactacc agaggcagct gtccagcaca 360 
taccgggacc tccggaaggg tgtgtatgtg ccctacaccc agggcaagtg ggaaggggag 420 
ctgggcaccg acctggtaag catcccccat ggccccaacg tcactgtgcg tgccaacatt 480 
gctgccatca ctgaatcaga caagttcttc atcaacggct ccaactggga aggcatcctg 54 0 
gggctggcct atgctgagac tgccaggcct gacgaccccc tggagccttt ctttgactct 600 
ctggtaaagc agacccacgt tcccaacctc ttctccctgc acctttgtgg tgctggcttc 660 
cccctcaacc agtctgaagt gctggcctct gtcggaggga gcatgatcat tggaggtatc 720 
gaccactcgc tgtacacagg cagtctctgg tatacaccca tccggcggga gtggtattat 780 
gaggtcatca ttgtgcgggt ggagatcaat ggacaggatc tgaaaatgga ctgcaaggag 840 
tacaactatg acaagagcat tgtggacagt ggcaccacca accttcgttt gcccaagaaa 900 
gtgtttgaag ctgcagtcaa atccatcaag gcagcctcct ccacggagaa gttccctgat 960 
ggtttctggc taggagagca gctggtgtgc tggcaagcag gcaccacccc ttggaacatt 1020 
ttcccagtca tctcactcta cctaatgggt gaggttacca accagtcctt ccgcatcacc 1080 
atccttccgc agcaatacct gcggccagtg gaagatgtgg ccacgtccca agacgactgt 1140 
tacaagtttg ccatctcaca gtcatccacg ggcactgtta tgggagctgt tatcatggag 1200 
ggcttctacg ttgtctttga tcgggcccga aaacgaattg gctttgctgt cagcgcttgc 1260 
catgtgcacg atgagttcag gacggcagcg gtggaaggcc cttttgtcac cttggacatg 1320 
gaagactgtg gctacaacat tccacagaca gatgagtcaa ccctcatgac catagcctat 1380 
gtcatggctg ccatctgcgc cctcttcatg ctgccactct gcctcatggt gtgtcagtgg 1440 
cgctgcctcc gctgcctgcg ccagcagcat gatgactttg ctgatgacat ctccctgctg 1500 
aagtgaggag gcccatgggc agaagataga gattcccctg gaccacacct ccgtggttca 1560 
ctttggtcac aagtaggaga cacagatggc acctgtggcc agagcacctc aggaccctcc 1620 
ccacccacca aatgcctctg ccttgatgga gaaggaaaag gctggcaagg tgggttccag 1680 
ggactgtacc tgtaggaaac agaaaagaga agaaagaagc actctgctgg cgggaatact 1740 
cttggtcacc tcaaatttaa gtcgggaaat tctgctgctt gaaacttcag ccctgaacct 1800 
ttgtccacca ttcctttaaa ttctccaacc caaagtattc ttrttttctt agtttcagaa 1860 
gtactggcat cacacgcagg ttaccttggc gtgtgtccct gtggtaccct ggcagagaag 1920 
agaccaagct tgtttccctg ctggccaaag tcagtaggag aggatgcaca gtttgctatt 1980 
tgctttagag acagggactg tataaacaag cctaacattg gtgcaaagat tgcctcttga 2040 
attaaaaaaa aaaaaaaaaa aaaaaaaaaa 207C 

<210> 4 

<211> 501 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Met Ala Gin Ala Leu Pro Trp Leu Leu Leu Trp Met Gly Ala 01 y Val 

15 10 15 

Leu Pro Ala His Gly Thr Gin His Gly lie Arg Leu Pro Leu Arg Ser 
20 25 30 

Gly Leu Gly Gly Ala Pro Leu Gly Leu Arg Leu Pro Arg Glu Thr Asp 
35 40 45 

Glu Glu Pro Glu Glu Pro Gly Arg Arg Gly Ser Phe Val Glu Met Val 
50 55 60 

Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly Tyr Tyr Val Glu Met Thr 
65 70 75 80 

Val Gly Ser Pro Pro Gin Thr Leu Asn lie Leu Val Asp Thr Gly Ser 

85 90 95 

Ser Asn Phe Ala Val Gly Ala Ala Pro His Pro Phe Leu His Arg Tyr 
100 105 110 



Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu Arg Lys Gly Val 
115 120 125 
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Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly Thr Asp 
130 135 140 

Leu Val Ser lie Pro His Gly Pro Asn Val Thr Val Arg Ala Asn He 
145 150 155 160 

Ala Ala He Thr Glu Ser Asp Lys Phe Phe He Asn Gly Ser Asn Trp 
165 170 175 

Glu Gly He Leu Gly Leu Ala Tyr Ala Glu He Ala Arg Pro Asp Asp 

180 185 190 

Ser Leu Glu Pro Phe Phe Asp Ser Leu Val Lys Gin Thr His Val Pro 
195 200 205 

Asn Leu Phe Ser Leu His Leu Cys Gly Ala Gly Phe Pro Leu Asn Gin 
210 215 220 

Ser Glu Val Leu Ala Ser Val Gly Gly Ser Met He He Gly Gly He 
225 230 235 240 

Asp His Ser Leu Tyr Thr Gly Ser Leu Trp Tyr Thr Pro He Arg Arg 
245 250 255 

Glu Trp Tyr Tyr Glu Val He He Val Arg Val Glu He Asn Gly Gin 
260 265 270 

Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn Tyr Asp Lys Ser He Val 
275 280 285 

Asp Ser Gly Thr Thr Asn Leu Arg Leu Pro Lys Lys Val Phe Glu Ala 

290 295 300 

Ala Val Lys Ser lie Lys Ala Ala Ser Ser Thr Glu Lys Phe Pro Asp 
305 310 315 320 

Gly Phe Trp Leu Gly Glu Gin Leu Val Cys Trp Gin Ala Gly Thr Thr 
325 330 335 

Pro Trp Asn He Phe Pro Val He Ser Leu Tyr Leu Met Gly Glu Val 
340 345 350 

Thr Asn Gin Ser Phe Arg He Thr He Leu Pro Gin Gin Tyr Leu Arg 
355 360 365 

Pro Val Glu Asp Val Ala Thr Ser Gin Asp Asp Cys Tyr Lys Phe Ala 
370 375 380 

He Ser Gin Ser Ser Thr Gly Thr Val Met Gly Ala Val He Met Glu 
385 390 395 400 

Gly Phe Tyr Val Val Phe Asp Arg Ala Arg Lys Arg He Gly Phe Ala 
405 410 415 

Val Ser Ala Cys His Val His Asp Glu Phe Arg Thr Ala Ala Val Glu 
420 425 430 



Gly Pro Phe Val Thr Leu Asp Met Glu Asp Cys Gly Tyr Asn He Pro 
435 440 445 



Gin Thr Asp Glu Ser Thr Leu Met Thr He Ala Tyr Val Met Ala Ala 
450 455 460 
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Ile Cys Ala Leu Phe Met Leu Pro Leu Cys Leu Met Val Cys Gin Trp 
465 470 475 480 

Arg Cys Leu Arg Cys Leu Arg Gin Gin His Asp Asp Phe Ala Asp Asp 
485 49C 495 

lie Ser Leu Leu Lys 
500 



<210> 5 

<211> 1977 

<212> DNA 

<213> Homo sapiens 

<400> 5 

atggcccaag ccctgccctg gctcctgctg tggatgggcg cgggagtgct gcctgcccac 60 

ggcacccagc acggcatccg gctgcccctg cgcagcggcc tggggggcgc ccccctgggg 120 

ctgcggctgc cccgggagac cgacgaagag cccgaggagc ccggccggag gggcagct::t 180 

gtggagatgg tggacaacct gaggggcaag tcggggcagg gccactacgt ggagatgacc 240 

gtgggcagcc ccccgcagac gctcaacatc ctggtggata caggcagcag taactttgca 30C 

gtgggtgctg ccccccaccc cttcctgcat cgctactacc agaggcagct gtccagcaca 36 C 

taccgggacc tccggaaggg tgtgtatgtg ccctacaccc agggcaagtg ggaaggggag 420 

ctgggcaccg acctggtaag catcccccat ggccccaacg tcactgtgcg tgccaacatt 480 

gctgccatca ctgaatcaga caagttcttc atcaacggct ccaactggga aggcaccctg 540 

gggctggcct atgctgagat tgccaggctt tgtggtgctg gcttccccct caaccagtct 600 

gaagtgctgg cctctgtcgg agggagcatg atcattggag gtatcgacca ctcgctgtac 660 

acaggcagtc tctggtatac acccatccgg cgggagtggt attatgaggt gatcattgtg 720 

cgggtggaga tcaatggaca ggatctgaaa atggactgca aggagtacaa ctatgacaag 780 

agcattgtgg acagtggcac caccaacctt cgtttgccca agaaagtgtt tgaagctgca 840 

gtcaaatcca tcaaggcagc ctcctccacg gagaagttcc ctgatggttt ctggctagga 900 

gagcagctgg tgtgctggca agcaggcacc accccttgga acattttccc agtcatctca 960 

ctctacctaa tgggtgaggt taccaaccag tccttccgca tcaccatcct tccgcagcaa 1020 

tacctgcggc cagtggaaga tgtggccacg tcccaagacg actgttacaa gtttgccatc 1080 

tcacagtcat ccacgggcac tgttatggga gctgttatca tggagggctt ctacgttgtc 1140 

tttgatcggg cccgaaaacg aattggcttt gctgtcagcg cttgccatgt gcacgatgag 1200 

ttcaggacgg cagcggtgga aggccctttt gtcaccttgg acatggaaga ctgtggctac 1260 

aacattccac agacagatga gtcaaccctc atgaccatag cctatgtcat ggctgccatc 1320 

tgcgccctct tcatgctgcc actctgcctc atggtgtgtc agtggcgctg cctccgctgc 1380 

ctgcgccagc agcatgatga ctttgctgat gacatctccc tgctgaagtg aggaggccca 1440 

tgggcagaag atagagattc ccctggacca cacctccgtg gttcactttg gtcacaagta 1500 

ggagacacag atggcacctg tggccagagc acctcaggac cctccccacc caccaaatgc 1560 

ctctgccttg atggagaagg aaaaggctgg caaggtgggt tccagggact gtacctgtag 1620 

gaaacagaaa agagaagaaa gaagcactct gctggcggga atactcttgg tcacctcaaa 1680 

tttaagtcgg gaaattctgc tgcttgaaac ttcagccctg aacctttgtc caccattcct 1740 

ttaaattctc caacccaaag tattcttctt ttcttagttt cagaagtact ggcatcacac 1800 

gcaggttacc ttggcgtgtg tccctgtggt accctggcag agaagagacc aagcttgttt 1860 

ccctgctggc caaagtcagt aggagaggat gcacagtttg ctatttgctt tagagacagg 1920 

gactgtataa acaagcctaa cattggtgca aagattgcct cttgaaaaaa aaaaaaa 1977 

<210> 6 

<2H> 476 

<212> PRT 

<213> Homo sapiens 

<400> 6 

Met Ala Gin Ala Leu Pro Trp Leu Leu Leu Trp Met Gly Ala Gly Val 
15 10 15 



Leu Pro Ala His Gly Thr Gin His Gly He Arg Leu Pro Leu Arg Ser. 
20 25 30 
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Gly Leu Gly Gly Ala Fro Leu Gly Leu Arg Leu Pro Arg Glu Thr Asp 
35 40 45 

Glu Glu Pro Glu Glu Pro Gly Arg Arg Gly Ser Phe Val Glu Met Val 
50 55 60 

Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly Tyr Tyr Val Glu Met Thr 
65 70 75 80 

Val Gly Ser Pro Pro Gin Thr Leu Asn lie Leu Val Asp Thr Gly Ser 

85 90 95 

Ser Asn Phe Ala Val Gly Ala Ala Pro His Pro Phe Leu His Arg Tyr 
100 105 110 

Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu Arg Lys Gly Val 
115 120 125 

Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly Thr Asp 
130 135 140 

Leu Val Ser lie Pro His Gly Pro Asn Val Thr Val Arg Ala Asn lie 
145 150 155 160 

Ala Ala He Thr Glu Ser Asp Lys Phe Phe He Asn Gly Ser Asn Trp 
165 170 175 

Glu Gly He Leu Gly Leu Ala Tyr Ala Glu He Ala Arg Leu Cys Gly 
180 185 190 

Ala Gly Phe Pro Leu Asn Gin Ser Glu Val Leu Ala Ser Val Gly Gly 
195 200 205 

Ser Met He He Gly Gly He Asp His Ser Leu Tyr Thr Gly Ser Leu 
210 215 220 

Trp Tyr Thr Pro He Arg Arg Glu Trp Tyr Tyr Glu Val He He Val 
225 230 235 240 

Arg Val Glu He Asn Gly Gin Asp Leu Lys Met Asp Cys Lys Glu Tyr 
245 250 255 

Asn Tyr Asp Lys Ser He Val Asp Ser Gly Thr Thr Asn Leu Arg Leu 
260 265 270 

Pro Lys Lys Val Phe Glu Ala Ala Val Lys Ser He Lys Ala Ala Ser 
275 280 285 

Ser Thr Glu Lys Phe Pro Asp Gly Phe Trp Leu Gly Glu Gin Leu Val 
290 295 300 

Cys Trp Gin Ala Gly Thr Thr Pro Trp Asn He Phe Pro Val He Ser 

305 310 315 320 

Leu Tyr Leu Met Gly Glu Val Thr Asn Gin Ser Phe Arg He Thr He 
325 330 335 

Leu Pro Gin Gin Tyr Leu Arg Pro Val Glu Asp Val Ala Thr Ser Gin 
340 345 350 

Asp Asp Cys Tyr Lys Phe Ala He Ser Gin Ser Ser Thr Gly Thr Val 
355 360 365 
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Met Gly Ala Val lie Met Glu Gly Phe Tyr Val Val Phe Asp Arg Ala 

370 375 380 

Arg Lys Arg He Gly Phe Ala Val Ser Ala Cys His Val His Asp Glu 
385 390 395 400 

Phe Arg Thr Ala Ala Val Glu Gly Pro Phe Val Thr Leu Asp Met Glu 
405 410 415 

Asp Cys Gly Tyr Asn He Pro Gin Thr Asp Glu Ser Thr Leu Met Thr 
420 425 430 

lie Ala Tyr Val Met Ala Ala He Cys Ala Leu Phe Met Leu Pro Leu 
435 440 445 

Cys Leu Met Val Cys Gin Trp Arg Cys Leu Arg Cys Leu Arg Gin Gin 

450 455 460 

His Asp Asp Phe Ala Asp Asp He Ser Leu Leu Lys 
465 470 475 



<210> 7 

<211> 2043 

<212> DNA 

<213> Mus musculus 



<400> 7 

atggccccag cgctgcactg gctcctgcta tgggtgggct cgggaatgct gcctgcccag 60 
ggaacccatc tcggcatccg gctgcccctt cgcagcggcc tggcagggcc acccctgggc 120 
ctgaggctgc cccgggagac tgacgaggaa tcggaggagc ctggccggag aggcagcttt 180 
gtggagatgg tggacaacct gaggggaaag tccggccagg gctactatgt ggagatgacc 240 
gtaggcagcc ccccacagac gctcaacatc ctggtggaca cgggcagtag taactttgca 300 
gtgggggctg ccccacaccc tttcctgcat cgctactacc agaggcagct gtccagcaca 360 
tatcgagacc tccgaaaggg tgtgtatgtg ccctacaccc agggcaagtg ggagggggaa 420 
ctgggcaccg acctggtgag catccctcat ggccccaacg tcactgtgcg tgccaacatt 480 
gctgccatca ctgaatcgga caagttcttc atcaatggtt ccaactggga gggcatccta 54 0 
gggctggcct atgctgagat tgccaggccc gacgactctt tggagccctt ctttgactcc 600 
ctggtgaagc agacccacat tcccaacatc ttttccctgc agctctgtgg cgctggcttc 660 
cccctcaacc agaccgaggc actggcctcg gtgggaggga gcatgatcat tggtggtatc 720 
gaccactcgc tatacacggg cagtctctgg tacacaccca tccggcggga gtggtattat 780 
gaagtgatca ttgtacgtgt ggaaatcaat ggtcaagatc tcaagatgga ctgcaaggag 84 0 
tacaactacg acaagagcat tgtggacagt gggaccacca accttcgctt gcccaagaaa 900 
gtatttgaag ctgccgtcaa gtccatcaag gcagcctcct cgacggagaa gttcccggat 960 
ggcttttggc taggggagca gctggtgtgc tggcaagcag gcacgacccc ttggaacatt 1020 
ttcccagtca tttcacttta cctcatgggt gaagtcacca atcagtcctt ccgcatcacc 1080 
atccttcctc agcaatacct acggccggtg gaggacgtgg ccacgtccca agacgactgt 1140 
tacaagttcg ctgtctcaca gtcatccacg ggcactgtta tgggagccgt catcatggaa 1200 
ggtttctatg tcgtcttcga tcgagcccga aagcgaattg gctttgctgt cagcgcttgc 1260 
catgtgcacg atgagttcag gacggcggca gtggaaggtc cgtttgttac ggcagacatg 1320 
gaagactgtg gctacaacat tccccagaca gatgagtcaa cacttatgac catagcctat 1380 
gtcatggcgg ccatctgcgc cctcttcatg ttgccactct gcctcatggt atgtcagtgg 1440 
cgctgcctgc gttgcctgcg ccaccagcac gatgactttg ctgatgacat ctccctgctc 150C 
aagtaaggag gctcgtgggc agatgatgga gacgcccctg gaccacatct gggtggttcc 1560 
ctttggtcac atgagttgga gctatggatg gtacctgtgg ccagagcacc tcaggaccct 1620 
caccaacctg ccaatgcttc tggcgtgaca gaacagagaa atcaggcaag ctggattaca 1680 
gggcttgcac ctgtaggaca caggagaggg aaggaagcag cgttctggtg gcaggaatat 174 0 
ccttaggcac cacaaacttg agttggaaat tttgctgctt gaagcttcag ccctgaccct 1800 
ctgcccagca tcctttagag tctccaacct aaagtattct ttatgtcctt ccagaagtac 1S60 
tggcgtcata ctcaggctac ccggcatgtg tccctgtggt accctggcag agaaagggcc 1920 
aatctcattc cctgctggcc aaagtcagca gaagaaggtg aagtttgcca gttgctttag 1980 
tgatagggac tgcagactca agcctacact ggtacaaaga ctgcgtcttg agataaacaa 2040 
gaa 2043 
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<210> 8 

<211> 501 

<:212> PRT 

<213> Mus musculus 

<400> 8 

Met Ala Pro Ala Leu His Trp Leu Leu Leu Trp Val Gly Ser Gly Met 
15 10 15 

Leu Pro Ala Gin Gly Thr His Leu Gly lie Arg Leu Pro Leu Arc Ser 
20 25 30 

Gly Leu Ala Gly Pro Pro Leu Gly Leu Arg Leu Pro Arg Glu Thr Asp 
35 40 45 

Glu Glu Ser Glu Glu Pro Gly Arg Arg Gly Ser Phe Val Glu Met Val 
50 55 60 

Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly Tyr Tyr Val Glu Met Thr 
65 70 75 80 

Val Gly Ser Pro Pro Gin Thr Leu Asn He Leu Val Asp Thr Gly Ser 

85 90 S3 

Ser Asn Phe Ala Val Gly Ala Ala Pro His Pro Phe Leu His Arg Tyr 
100 105 110 

Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu Arg Lys Gly Val 
115 120 125 

Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly Thr Asp 
130 135 140 

Leu Val Ser He Pro His Gly Pro Asn Val Thr Val Arg Ala Asn He 
145 150 155 160 

Ala Ala He Thr Glu Ser Asp Lys Phe Phe He Asn Gly Ser Asn Trp 
165 170 175 

Glu Gly He Leu Gly Leu Ala Tyr Ala Glu He Ala Arg Pro Asp Asp 
180 185 190 

Ser Leu Glu Pro Phe Phe Asp Ser Leu Val Lys Gin Thr His He Pro 
195 200 205 

Asn He Phe Ser Leu Gin Leu Cys Gly Ala Gly Phe Pro Leu Asn Gin 
210 215 220 

Thr Glu Ala Leu Ala Ser Val Gly Gly Ser Met He He Gly Gly He 
225 230 235 240 

Asp His Ser Leu Tyr Thr Gly Ser Leu Trp Tyr Thr Pro He Arg Arg 
245 250 255 

Glu Trp Tyr Tyr Glu Val He He Val Arg Val Glu He Asn Gly Gin 
260 265 270 

Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn Tyr Asp Lys Ser He Val 

275 280 285 

Asp Ser Gly Thr Thr Asn Leu Arg Leu Pro Lys Lys Val Phe Glu Ala 
290 295 300 



wo 01/49097 



PCT/IBOl/00797 



-10- 

Ala Val Lys Ser lie Lys Ala Ala Ser Ser Thr Glu Lys Phe Pro Asp 
305 310 315 320 

Gly Phe Trp Leu Gly Glu Gin Leu Val Cys ?rp Gin Ala Gly Thr Thr 
325 330 335 

Pro Trp Asn He Phe Pro Val He Ser Leu Tyr Leu Met Gly Glu Val 
340 345 350 

Thr Asn Gin Ser Phe Arg He Thr He Leu Pro Gin Gin Tyr Leu Arg 
355 360 365 

Pro Val Glu Asp Val Ala Thr Ser Gin Asp Asp Cys Tyr Lys Phe Ala 
370 375 380 

Val Ser Gin Ser Ser Thr Gly Thr Val Met Gly Ala Val He Met Glu 

385 390 395 40C 

Gly Phe Tyr Val Val Phe Asp Arg Ala Arg Lys Arg He Gly Phe Ala 
405 410 415 

Val Ser Ala Cys His Val Kis Asp Glu Phe Arg Thr Ala Ala Val Glu 
420 425 430 

Gly Pro Phe Val Thr Ala Asp Met Glu Asp Cys Gly Tyr Asn He Pro 
435 440 445 

Gin Thr Asp Glu Ser Thr Leu Met Thr He Ala Tyr Val Met Ala Ala 
450 455 460 

He Cys Ala Leu Phe Met Leu Pro Leu Cys Leu Met Val Cys Gin Trp 
465 470 475 480 

Arg Cys Leu Arg Cys Leu Arg His Gin His Asp Asp Phe Ala Asp Asp 
485 490 495 

He Ser Leu Leu Lys 
500 



<210> 9 
<211> 2088 
<212> DNA 

<213> Homo sapiens 
<400> 9 

atgctgcccg gtttggcact gctcctgctg gccgcctgga cggctcgggc gctggaggta 60 
cccactgatg gtaatgctgg cctgctggct gaaccccaga ttgccatgtt ctgtggcaga 120 
ctgaacatgc acatgaatgt ccagaatggg aagtgggatt cagatccatc agggaccaaa 180 
acctgcattg ataccaagga aggcatcctg cagtattgcc aagaagtcta ccctgaactg 240 
cagatcacca atgtggtaga agccaaccaa ccagtgacca tccagaactg gtgcaagcgg 300 
ggccgcaagc agtgcaagac ccatccccac tttgtgattc cctaccgctg cttagttggt 360 
gagtttgtaa gtgatgccct tctcgttcct gacaagtgca aattcttaca ccaggagagg 420 
atggatgttt gcgaaactca tcttcactgg cacaccgtcg ccaaagagac atgcagtgag 4 80 
aagagtacca acttgcatga ctacggcatg ttgctgccct gcggaattga caagttccga 54 0 
ggggtagagt ttgtgtgttg cccactggct gaagaaagtg acaatgtgga ttctgctgat 600 
gcggaggagg atgactcgga tgtctggtgg ggcggagcag acacagacta tgcagatggg 660 
agtgaagaca aagtagtaga agtagcagag gaggaagaag tggctgaggt ggaagaagaa 720 
gaagccgatg atgacgagga cgatgaggat ggtgatgagg tagaggaaga ggctgaggaa 780 
ccctacgaag aagccacaga gagaaccacc agcattgcca ccaccaccac caccaccaca 840 
gagtctgtgg aagaggtggt tcgagttcct acaacagcag ccagtacccc tgatgccgtt 900 
gacaagtatc tcgagacacc tggggatgag aatgaacatg cccatttcca gaaagccaaa 960 
gagaggcttg aggccaagca ccgagagaga atgtcccagg tcatgagaga atgggaagag 1020 
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gcagaacgtc aagcaaagaa cttgcctaaa gctgataaga aggcagttat ccagcatttc 1080 
caggagaaag tggaatcttt ggaacaggaa gcagccaacg agagacagca gctggtggag 114 0 
acacacatgg ccagagtgga agccatgccc aatgaccgcc gccgcctggc cctggagaac 1200 
tacatcaccg ctctgcaggc tgttcctccc cggcctcgtc acgtgttcaa tatgctaaag 1260 
aagtatgtcc gcgcagaaca gaaggacaga cagcacaccc taaagcattt cgagcatgtg 1320 
cgcatggtgg atcccaagaa agccgctcag atccggtccc aggttatgac acacctccgt 1380 
gtgatttatg agcgcatgaa tcagtctctc tccctgctct acaacgtgcc tgcagtggcc 1440 
gaggagattc aggatgaagt tgatgagctg cttcagaaag agcaaaacta ttcagatgac 1500 
gtcttggcca acatgattag tgaaccaagg atcagttacg gaaacgatgc tctcatgcca 1560 
tctttgaccg aaacgaaaac caccgtggag ctccttcccg tgaatggaga gttcagcctg 1620 
gacgatctcc agccgtggca ttcttttggg gctgactctg tgccagccaa cacagaaaac 1680 
gaagttgagc ctgttgatgc ccgccctgct gccgaccgag gactgaccac tcgaccaggt 174 0 
tctgggttga caaatatcaa gacggaggag atctctgaag tgaagatgga tgcagaattc 1800 
cgacatgact caggatatga agttcatcat caaaaattgg tgttctttgc agaagatgtg 1860 
ggttcaaaca aaggtgcaat cattggactc atggtgggcg gtgttgtcat agcgacagtg 1920 
atcgtcatca ccttggtgat gctgaagaag aaacagtaca catccattca tcatggtgtg 1980 
gtggaggttg acgccgctgt caccccagag gagcgccacc tgtccaagat gcagcagaac 2040 
ggctacgaaa atccaaccta caagttcttt gagcagatgc agaactag 2088 

<210> 10 
<211> 695 
<212> PRT 

<213> Homo sapiens 
<400> 10 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
15 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 .30 

Gin He Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 
50 55 60 

Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 
85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 110 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 



Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 
165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 
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Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 300 



Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 
325 330 335 

Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 

340 345 350 

Lys Lys Ala Val He Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 380 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 

Tyr He Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 
405 410 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 
420 425 430 

Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 

Ala Gin He Arg Ser Gin Val Met Thr His Leu Arg Val He Tyr Glu 
450 455 460 



Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 

Glu Glu He Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 
485 490 495 



Tyr Ser Asp Asp Val Leu Ala Asn Met He Ser Glu Pro Arg He Ser 
500 505 510 
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Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 
515 520 525 

Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asd Asp Leu Gin 
530 535 540 

Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 
545 550 555 560 

Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 

565 570 575 

Thr Arg Pro Gly Ser Gly Leu Thr Asn lie Lys Thr Glu Glu lie Ser 
580 585 590 



Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 
595 600 605 

His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys 
610 615 620 

Gly Ala lie He Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 
625 630 635 640 

He Val He Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He 
645 650 655 

His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 
660 665 670 

His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 
675 680 685 

Phe Phe Glu Gin Met Gin Asn 
690 695 



<210> 11 

<211> 2088 

<212> DNA 

<213> Homo sapiens 

<400> 11 

atgctgcccg gtttggcact gctcctgctg 

cccactgatg gtaatgctgg cctgctggct 

ctgaacatgc acatgaatgt ccagaatggg 

acctgcattg ataccaagga aggcatcctg 

cagatcacca atgtggtaga agccaaccaa 

ggccgcaagc agtgcaagac ccatccccac 

gagtttgtaa gtgatgccct tctcgttcct 

atggatgttt gcgaaactca tcttcaccgg 

aagagtacca acttgcatga ctacggcatg 

ggggtagagt ttgtgtgttg cccaccggct 

gcggaggagg atgactcgga tgtccggtgg 

agtgaagaca aagtagtaga agtagcagag 

gaagccgatg atgacgagga cgatgaggat 

ccctacgaag aagccacaga gagaaccacc 

gagtctgtgg aagaggtggt tcgagttcct 

gacaagtatc tcgagacacc tggggatgag 

gagaggcttg aggccaagca ccgagagaga 

gcagaacgtc aagcaaagaa cttgcctaaa 

caggagaaag tggaatcttt ggaacaggaa 



gccgcctgga cggctcgggc gctggaggta 60 
gaaccccaga ttgccatgtt ctgtggcaga 120 
aagtgggatt cagatccatc agggaccaaa 180 
cagtattgcc aagaagtcta ccctgaactg 240 
ccagtgacca tccagaactg gtgcaagcgg 300 
tttgtgattc cctaccgctg cttagttggt 360 
gacaagtgca aattcttaca ccaggagagg 420 
cacaccgtcg ccaaagagac atgcagtgag 480 
ttgctgccct gcggaattga caagttccga 540 
gaagaaagtg acaatgtgga ttctgctgat 600 
ggcggagcag acacagacta tgcagatggg 660 
gaggaagaag tggctgaggt ggaagaagaa 720 
ggtgatgagg tagaggaaga ggctgaggaa 780 
agcattgcca ccaccaccac caccaccaca 840 
acaacagcag ccagtacccc tgatgccgtt 900 
aatgaacatg cccatttcca gaaagccaaa 960 
atgtcccagg tcatgagaga atgggaagag 1020 
gctgataaga aggcagttat ccagcatttc 1080 
gcagccaacg agagacagca gctggcggag 1140 
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acacacatgg ccagagtgga agccatgctc aatgaccgcc gccgcctggc cctggagaac 1200 

tacatcaccg ctctgcaggc tgttcct.cct cggcctcgtc acgtgttcaa tatgctaaag 1260 

aagtatgtcc gcgcagaaca gaaggacaga cagcacaccc taaagcattc cgagcatgtg 1320 

cgcatggtgg atcccaagaa agccgctcag atccggtccc aggttatgac acacctccgt 1380 

gtgatttatg agcgcatgaa tcagtctctc tccctgctct acaacgtgcc tgcagtggcc 1440 

gaggagattc aggatgaagt tgatgagctg cttcagaaag agcaaaacta ttcagatgac 1500 

gtcttggcca acatgattag tgaaccaagg atcagttacg gaaacgatgc tctcatgcca 1560 

tctttgaccg aaacgaaaac caccgtggag ctccttcccg tgaatggaga gttcagcctg 1620 

gacgatctcc agccgtggca ttcttttggg gctgactctg toccagccaa cacagaaaac 1680 

gaagttgagc ctgttgatgc ccgccctgct gccgaccgag gactgaccac tcgaccaggt 1740 

tctgggttga caaatatcaa gacggaggag atctctgaag tgaatctgga tgcagaattc 1800 

cgacatgact caggatatga agttcatcat caaaaattgg tgttctttgc agaagatgtg 1860 

ggttcaaaca aaggtgcaat cattggactc atggtgggcg gtgttgtcat agcgacagtg 1920 

atcgtcatca ccttggtgat gctgaagaag aaacagtaca catccattca tcatggtgtg 1980 

gtggaggttg acgccgctgt caccccagag gagcgccacc tgtccaagat gcagcagaac 2040 

ggctacgaaa atccaaccta caagttcttt gagcagatgc agaactag 2088 

<210> 12 
<211> 695 
<212> PRT 
<213> Homo sapiens 

<400> 12 

Met Leu Pro Gly Leu 
1 5 

Ala Leu Glu Val Pro 

20 

Gin He Ala Met Phe 
35 

Asn Gly Lys Trp Asp 
50 

Thr Lys Glu Gly He 
65 

Gin He Thr Asn Val 
85 

Trp Cys Lys Arg Gly 
100 

He Pro Tyr Arg Cys 
115 

Val Pro Asp Lys Cys 
130 

Glu Thr His Leu His 
145 



Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
10 15 

Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
25 30 

Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
40 45 

Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 

55 60 

Leu Gin Tyr Cys Gin Glu Val Tyr Pre Glu Leu 
70 75 80 

Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 
90 95 

Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
105 110 

Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
120 125 

Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
135 140 



Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
150 155 160 



Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 
165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 
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Trp Trp Gly G3y Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asd Lys 
210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser He 

260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 

290 295 300 

Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 
325 330 335 

Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 
340 345 350 

Lys Lys Ala Val He Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 

370 375 380 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 

Tyr He Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 
405 410 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 
420 425 430 

Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 

Ala Gin He Arg Ser Gin Val Met Thr His Leu Arg Val He Tyr Glu 
450 455 460 



Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 

Glu Glu He Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 
485 490 495 



Tyr Ser Asp Asp Val Leu Ala Asn Met He Ser Gla Pro Arg He Ser 

500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 
515 520 525 

Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 
530 535 540 
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Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 
545 550 555 560 

Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 

565 570 575 

Thr Arg Pro Gly Ser Gly Leu Thr Asn He Lys Thr Glu Glu He Ser 
580 585 590 

Glu Val Asn Leu Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 

595 600 605 

His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys 
610 615 620 

Gly Ala He He Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 
625 630 635 640 

He Val He Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He 
645 650 655 

His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 
660 665 670 

His T,eu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 
675 680 685 



Phe Phe Glu Gin Met Gin Asn 
690 695 



<210> 13 

<211> 2088 

<212> DNA 

<213> Homo sapiens 

<400> 13 

atgctgcccg gtttggcact gctcctgctg 
cccactgatg gtaatgctgg cctgctggct 
ctgaacatgc acatgaatgt ccagaatggg 
acctgcattg ataccaagga aggcatcctg 
cagatcacca atgtggtaga agccaaccaa 
ggccgcaagc agtgcaagac ccatccccac 
gagtttgtaa gtgatgccct tctcgttcct 
atggatgttt gcgaaactca tcttcactgg 
aagagtacca acttgcatga ctacggcatg 
9999tagagt ttgtgtgttg cccactggct 
gcggaggagg atgactcgga tgtctggtgg 
agtgaagaca aagtagtaga agtagcagag 
gaagccgatg atgacgagga cgatgaggat 
ccctacgaag aagccacaga gagaaccacc 
gagtctgtgg aagaggtggt tcgagttcct 
gacaagtatc tcgagacacc tggggatgag 
gagaggcttg aggccaagca ccgagagaga 
gcagaacgtc aagcaaagaa cttgcctaaa 
caggagaaag tggaatcttt ggaacaggaa 
acacacatgg ccagagtgga agccatgctc 
tacatcaccg ctctgcaggc tgttcctcct 
aagtatgtcc gcgcagaaca gaaggacaga 
cgcatggtgg atcccaagaa agccgctcag 
gtgatttatg agcgcatgaa tcagtctctc 
gaggagattc aggatgaagt tgatgagctg 
gtcttggcca acatgattag tgaaccaagg 



gccgcctgga cggctcgggc gctggaggta 60 
gaaccccaga ttgccatgtt ctgtggcaga 120 
aagtgggatt cagatccatc agggaccaaa 180 
cagtattgcc aagaagtcta ccctgaactg 240 
ccagtgacca tccagaactg gtgcaagcgg 300 
tttgtgattc cctaccgctg cttagttggt 360 
gacaagtgca aattcttaca ccaggagagg 420 
cacaccgtcg ccaaagagac atgcagtgag 480 
ttgctgccct gcggaattga caagttccga 54 0 
gaagaaagtg acaatgtgga ttctgctgat 600 
ggcggagcag acacagacta tgcagatggg 660 
gaggaagaag tggctgaggt ggaagaagaa 720 
ggtgatgagg tagaggaaga ggctgaggaa 780 
agcattgcca ccaccaccac caccaccaca 840 
acaacagcag ccagtacccc tgatgccgtt 900 
aatgaacatg cccatttcca gaaagccaaa 960 
atgtcccagg tcatgagaga atgggaagag 1020 
gctgataaga aggcagttat ccagcatttc 1080 
gcagccaacg agagacagca gctggtggag 114 0 
aatgaccgcc gccgcctggc cctggagaac 1200 
cggcctcgtc acgtgttcaa tatgctaaag 1260 
cagcacaccc taaagcattt cgagcatgtg 1320 
atccggtccc aggttatgac acacctccgt 1380 
tccctgctct acaacgtgcc tgcagtggcc 1440 
cttcagaaag agcaaaacta ttcagatgac 1500 
atcagttacg gaaacgatgc tctcatgcca 1560 
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tctttgaccg aaacgaaaac 
gacgatctcc agccgtggca 
gaagttgagc ctgttgatgc 
tctgggttga caaatatcaa 
cgacatgact caggatatga 
ggttcaaaca aaggtgcaat 
atcttcatca ccttggtgat 
gtggaggttg acgccgctgt 
ggccacgaaa atccaaccta 

<210> 14 

<211> 695 
<212> PRT 

<213> Homo sapiens 
<400> 14 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala AJa Trp Thr Ala Arg 
15 10 15 

Ala Leu Glu Val Pro Thr Aso Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 ' 25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 
50 55 60 

Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 
85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 ilG 



caccgtggag ctccttcccg tgaatggaga gttcagcctg 1620 
ttcttctggg gctgactctg tgccagccaa cacagaaaac 1680 
ccgccctgct gccgaccgag gactgaccac tcgaccaggt 174 0 
gacggaggag atctctgaag tgaagatgga tgcagaattc 1800 
agttcatcat caaaaattgg tgttctttgc agaagatgtg 1860 
catcggactc atggtgggcg gtgttgtcat agcgacagtg 1920 
gctgaagaag aaacagtaca catccattca tcatggtgtg 1980 
caccccagag gagcgccacc tgtccaagat gcagcagaac 2040 
caagtccttt gagcagatgc agaactag 2088 



He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 

115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 
165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 
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Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 



Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 

275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 300 

Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 
325 330 335 

Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 
340 345 350 

Lys Lys Ala Val lie Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 380 



Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 40C 



Tyr He Thr Ala Leu Gin Ala Val 

405 

Asn Met Leu Lys Lys Tyr Val Arg 
420 

Thr Leu Lys His Phe Glu His Val 
435 440 

Ala Gin He Arg Ser Gin Val Met 
450 455 

Arg Met Asn Gin Ser Leu Ser Leu 
465 470 

Glu Glu He Gin Asp Glu Val Asp 
485 

Tyr Ser Asp Asp Val Leu Ala Asn 
500 



Pro Pro Arg Pro Arg His Val Phe 
410 415 

Ala Glu Gin Lys Asp Arg Gin His 
425 430 

Arg Met Val Asp Pro Lys Lys Ala 
445 

Thr His Leu Arg Val He Tyr Glu 
460 

Leu Tyr Asn Val Pro Ala Val Ala 
475 480 

Glu Leu Leu Gin Lys Glu Gin Asn 
490 495 

Met He Ser Glu Pro Arg He Ser 
505 510 



Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 
515 520 525 

Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 

530 535 540 



Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 
545 550 555 560 
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Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 
565 570 575 

Thr Arg Pro Gly Ser Gly Leu Thr Asn lie Lys Thr Glu Glu lie Ser 
580 585 590 

Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 
595 600 605 

His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys 
610 615 620 

Gly Ala lie He Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 
625 630 635 640 

He Phe He Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He 
645 650 655 

His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 
660 665 670 

His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 
675 680 685 



Phe Phe Glu Gin Met Gin Asn 
690 695 



<210> 15 
<211> 2094 
<212> DNA 

<213> Homo sapiens 
<400> IS 

atgctgcccg gtttggcact gctcctgctg gccacctgga cggctcgggc gctggaggta 60 
cccactgatg gtaatgctgg cctgctggct gaaccccaga ttgccatgtt ctgtggcaga 120 
ctgaacatgc acatgaatgt ccagaatggg aagtgggatt cagatccatc agggaccaaa 180 
acctgcattg ataccaagga aggcatcctg cagtattgcc aagaagtcta ccctgaactg 240 
cagatcacca atgtggtaga agccaaccaa ccagtgacca tccagaactg gtgcaagcgg 300 
ggccgcaagc agtgcaagac ccatccccac tttgcgattc cctaccgctg cttagttggt 360 
gagtttgtaa gtgatgccct tctcgttcct gacaagtgca aattcttaca ccaggagagg 420 
atggatgttt gcgaaactca tcttcactgg cacaccgtcg ccaaagagac atgcagtgag 480 
aagagtacca acttgcatga ctacggcatg ttgctgccct gcggaattga caagttccga 54 0 
ggggtagagt ttgtgtgttg cccactggct gaagaaagtg acaatgtgga ttctgctgat 600 
gcggaggagg atgactcgga tgtctggtgg ggcggagcag acacagacta tgcagatggg 660 
agtgaagaca aagtagtaga agtagcagag gaggaagaag tggctgaggt ggaagaagaa 720 
gaagccgatg atgacgagga cgatgaggat ggtgatgagg tagaggaaga ggctgaggaa 780 
ccctacgaag aagccacaga gagaaccacc agcattgcca ccaccaccac caccaccaca 84 0 
gagtctgtgg aagaggtggt tcgagttcct acaacagcag ccagtacccc tgatgccgtt 900 
gacaagtatc tcgagacacc tggggatgag aatgaacatg cccatttcca gaaagccaaa 960 
gagaggcttg aggccaagca ccgagagaga atgtcccagg tcatgagaga atgggaagag 1020 
gcagaacgtc aagcaaagaa cttgcctaaa gctgataaga aggcagttat ccagcatttc 1080 
caggagaaag tggaatcttt ggaacaggaa gcagccaacg agagacagca gctggtggag 1140 
acacacatgg ccagagtgga agccatgctc aatgaccgcc gccgcctggc cctggagaac 1200 
tacatcaccg ctctgcaggc tgttcctcct cggcctcgtc acgtgttcaa tatgctaaag 1260 
aagtatgtcc gcgcagaaca gaaggacaga cagcacaccc taaagcattt cgagcatgtg 1320 
cgcatggtgg atcccaagaa agccgctcag atccggtccc aggttatgac acacctccgt 1380 
gtgatttatg agcgcatgaa tcagtctctc tccctgctct acaacgtgcc cgcagtggcc 1440 
gaggagattc aggatgaagt tgacgagctg cttcagaaag agcaaaacta ttcagatgac 1500 
gtcttggcca acatgattag tgaaccaagg atcagttacg gaaacgatgc tctcatgcca 1560 
tctttgaccg aaacgaaaac caccgtggag ctccttcccg tgaatggaga gttcagcctg 1620 
gacgatctcc agccgtggca ttcttttggg gctgactctg tgccagccaa cacagaaaac 1680 
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gaagttgagc ctgttgatgc ccgccccgct gccgaccgag gactgaccac tcgaccaggt 1740 
tctgggttga caaatatcaa gacggaggag atctctgaag cgaagatgga tgcagaattc 1800 
cgacatgact caggatatga agttcatcat caaaaattgg cgttctttgc agaagatgtg I860 
ggttcaaaca aaggtgcaat cattggactc atggtgggcg gtgttgtcat agcgacagtg 1920 
atcgtcatca ccttggtgat gctgaagaag aaacagtaca catccattca tcatggtgtg 1980 
gtggaggttg acgccgctgt caccccagag gagcgccacc tgtccaagat gcagcagaac 2040 
ggctacgaaa atccaaccta caagttcttt gagcagatgc agaacaagaa gtag 2094 

<210> 16 

<211> 697 

<212> PRT 

<213> Homo sapiens 

<400> 16 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
15 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 

20 25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys lie Asp 
50 55 60 

Thr Lys Glu Gly lie Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin lie Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 
85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr Hxs Pro His Phe Val 
100 105 110 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 
165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 



Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 
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Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Giu Arg Thr Thr Ser He 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 



Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 300 



Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Vai Met Arg 
325 330 335 

Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 
340 345 350 

Lys Lys Ala Val He Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 380 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 

Tyr He Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 
405 410 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 
420 425 430 

Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 

Ala Gin He Arg Ser Gin Val Met Thr His Leu Arg Val He Tyr Glu 

450 455 460 

Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 

Glu Glu He Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 
485 490 495 

Tyr Ser Asp Asp Val Leu Ala Asn Met He Ser Glu Pro Arg He Ser 
500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 
515 520 525 



Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 
530 535 540 



Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 

545 550 555 560 

Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 
565 570 575 
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Thr Arg Pro Gly Ser Gly Leu Thr Asn lie Lys Thr Glu Glu lie Ser 
580 585 590 



Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 
595 600 605 



His His Gin Lys Leu Val Phe Phe Ala Giu Asp Val Gly Ser Asn Lys 
610 615 620 

Gly Ala He He Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 

625 630 635 640 

He Val He Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He 
645 650 655 

His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 
660 665 670 

His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 
675 680 685 

Phe Phe Glu Gin Met Gin Asn Lys Lys 
690 695 



<210> 17 

<211> 2094 

<212> DNA 

<213> Homo sapiens 

<400> 17 

atgctgcccg gtttggcact gctcctgctg gccgcctgga cggctcgggc gctggaggta 60 
cccactgatg gtaatgctgg cctgctggct gaaccccaga ttgccatgtt ctgtggcaga 120 
ctgaacatgc acatgaatgt ccagaatggg aagtgggatt cagatccatc agggaccaaa 180 
acctgcattg ataccaagga aggcatcctg cagtattgcc aagaagtcta ccctgaactg 240 
cagatcacca atgtggtaga agccaaccaa ccagtgacca tccagaactg gtgcaagcgg 300 
ggccgcaagc agtgcaagac ccatccccac tttgtgattc cctaccgctg cttagttggt 360 
gagtttgtaa gtgatgccct tctcgttcct gacaagtgca aattcttaca ccaggagagg 420 
atggatgttt gcgaaactca tcttcactgg cacaccgtcg ccaaagagac atgcagtgag 480 
aagagtacca acttgcatga ctacggcatg ttgctgccct gcggaattga caagttccga 540 
ggggtagagt ttgtgtgttg cccactggct gaagaaagtg acaatgtgga ttctgctgat 600 
gcggaggagg atgactcgga tgtctggtgg ggcggagcag acacagacta tgcagatggg 660 
agtgaagaca aagtagtaga agtagcagag gaggaagaag tggctgaggt ggaagaagaa 720 
gaagccgatg atgacgagga cgatgaggat ggtgatgagg tagaggaaga ggctgaggaa 780 
ccctacgaag aagccacaga gagaaccacc agcattgcca ccaccaccac caccaccaca 840 
gagtctgtgg aagaggtggt tcgagttcct acaacagcag ccagtacccc tgatgccgtt 900 
gacaagtatc tcgagacacc tggggatgag aatgaacatg cccatttcca gaaagccaaa 960 
gagaggcttg aggccaagca ccgagagaga atgtcccagg tcatgagaga atgggaagag 1020 
gcagaacgtc aagcaaagaa cttgcctaaa gctgataaga aggcagttat ccagcatttc 1080 
caggagaaag tggaatcttt ggaacaggaa gcagccaacg agagacagca gctggtggag 1140 
acacacatgg ccagagtgga agccatgctc aatgaccgcc gccgcctggc cctggagaac 1200 
tacatcaccg ctctgcaggc tgttcctcct cggcctcgtc acgtgttcaa tatgctaaag 1260 
aagtatgtcc gcgcagaaca gaaggacaga cagcacaccc taaagcattt cgagcatgtg 1320 
cgcatggtgg atcccaagaa agccgctcag atccggtccc aggttatgac acacctccgt 1380 
gtgatttatg agcgcatgaa tcagtctctc tccctgctct acaacgtgcc tgcagtggcc 1440 
gaggagattc aggatgaagt tgatgagctg cttcagaaag agcaaaacta ttcagatgac 1500 
gtcttggcca acatgattag tgaaccaagg atcagttacg gaaacgatgc tctcatgcca 1560 
tctttgaccg aaacgaaaac caccgtggag ctccttcccg tgaatggaga gttcagcctg 1620 
gacgatctcc agccgtggca ttcttttggg gctgactctg tgccagccaa cacagaaaac 1680 
gaagttgagc ctgttgatgc ccgccctgct gccgaccgag gactgaccac tcgaccaggt 1740 
tctgggttga caaatatcaa gacggaggag atctctgaag tgaatctgga tgcagaattc 1800 
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cgacatgact caggatatga agttcatcat caaaaatcgg tgttctttgc agaagatgtq 1860 
ggttcaaaca aaggtgcaat. cattggactc atggtgggcg gtgttgtcat agcgacagtg 1920 
atcgtcatca ccttggt.gac gctgaagaag aaacagcaca catccattca tcatggtgtg 1980 
gtggaggttg acgccgctgc caccccagag gagcgccacc cgtccaagat gcagcagaac 2040 
ggctacgaaa atccaaccta caagttcttt gagcagatgc agaacaagaa gtag 2094 

<210> 18 
<211> 697 
<212> PRT 

<213> Homo sapiens 
<400> 18 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
15 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 

20 25 3C 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 
50 55 60 

Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pre Glu Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 
85 90 95. 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 lie 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

/ 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 

165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Giu Glu Asp Asp Ser Asp Val 
195 200 205 



Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 

225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser He 
260 265 270 
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Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 300 



Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 
325 330 335 

Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 
340 345 350 

Lys Lys Ala Val He Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 380 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 

Tyr He Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 
405 410 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 

420 425 430 

Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 

Ala Gin He Arg Ser Gin Val Met Thr His Leu Arg Val He Tyr Glu 
450 455 460 

Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 

Glu Glu He Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 
485 490 495 



Tyr Ser Asp Asp Val Leu Ala Asn Met He Ser Glu Pro Arg He Ser 
500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 
515 520 525 

Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 
530 535 540 



Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 

545 550 555 560 

Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 
565 570 575 



Thr Arg Pro Gly Ser Gly Leu Thr Asn He Lys Thr Glu Glu He Ser 
580 585 590 
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Glu Val Asn Leu Asp Ala Glu 

595 



Phe Arg His Asp Ser Gly Tyr Glu Val 

600 605 



His His Gin Lys Leu Val Phe 
610 615 



Fhe Ala Glu Asp Val Gly Ser Asn Lys 
620 



Gly Ala He He Gly Leu Met 
625 630 



Val Gly Gly Val Val He Ala Thr Val 
635 640 



He Val He Thr Leu Val Met 
645 



Leu Lys Lys Lys Gin Tyr Thr Ser He 
650 655 



His His Gly Val Val Glu Val 
660 



Asp Ala Ala Val Thr Pro Glu Glu Arg 
665 670 



His Leu Ser Lys Met Gin Gin 
675 



Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 
680 685 



Phe Phe Glu Gin Met Gin Asn Lys Lys 
690 695 



<210> 19 
<211> 2094 
<212> DNA 

<213> Homo sapiens 
<400> 19 

atgctgcccg gtttggcact gctcctgctg gccgcctgga cggctcgggc gctggaggta 60 
cccactgatg gtaatgctgg cctgctggct gaaccccaga ttgccatgtt ctgtggcaga 120 
ctgaacatgc acatgaatgt ccagaatggg aagtgggatt cagatccatc agggaccaaa 180 
acctgcattg ataccaagga aggcatcctg cagtattgcc aagaagtcta ccctgaactg 240 
cagatcacca atgtggtaga agccaaccaa ccagtgacca tccagaactg gtgcaagcgg 300 
ggccgcaagc agtgcaagac ccatccccac tttgtgattc cctaccgctg cttagttggt 360 
gagtttgtaa gtgatgccct tctcgttcct gacaagtgca aattcttaca ccaggagagg 420 
atggacgttt gcgaaactca tcttcactgg cacaccgtcg ccaaagagac atgcagtgag 480 
aagagtacca acttgcatga ctacggcatg ttgctgccct gcggaattga caagrtccga 540 
ggggtagagt ttgtgtgttg cccactggct gaagaaagtg acaatgtgga ttctgctgat 600 
gcggaggagg atgactcgga tgtctggtgg ggcggagcag acacagacta tgcagatggg 660 
agtgaagaca aagtagtaga agtagcagag gaggaagaag tggctgaggt ggaagaagaa 720 
gaagccgatg atgacgagga cgatgaggat ggtgatgagg tagaggaaga ggctgaggaa 78C 
ccctacgaag aagccacaga gagaaccacc agcattgcca ccaccaccac caccaccaca 840 
gagtctgtgg aagaggtggt tcgagttcct acaacagcag ccagtacccc tgatgccgtt 900 
gacaagtatc tcgagacacc tggggatgag aatgaacatg cccatttcca gaaagccaaa 960 
gagaggcttg aggccaagca ccgagagaga atgtcccagg tcatgagaga atgggaagag 1020 
gcagaacgtc aagcaaagaa cttgcctaaa gctgataaga aggcagttat ccagcatttc 1080 
caggagaaag tggaatcttt ggaacaggaa gcagccaacg agagacagca gctggtggag 1140 
acacacatgg ccagagtgga agccacgctc aatgaccgcc gccgcctggc cctggagaac 1200 
tacatcaccg ctctgcaggc tgttcctcct cggcctcgtc acgtgttcaa tatgctaaag 1260 
aagtatgtcc gcgcagaaca gaaggacaga cagcacaccc taaagcattt cgagcatgtg 1320 
cgcatggtgg atcccaagaa agccgctcag atccggtccc aggttatgac acacct-ccgt 1380 
gtgatttatg agcgcatgaa tcagtctctc tccctgctct acaacgtgcc tgcagriggcc 1440 
gaggagattc aggatgaagt tgatgagctg cttcagaaag agcaaaacta ttcagatgac 1500 
gtcttggcca acatgattag tgaaccaagg atcagttacg gaaacgatgc tctcatgcca 1560 
tctttgaccg aaacgaaaac caccgtggag ctccttcccg tgaatggaga gttcagcctg 1620 
gacgatctcc agccgtggca ctcttttggg gctgactctg tgccagccaa cacagaaaac 1680 
gaagttgagc ctgttgatgc ccgccctgct gccgaccgag gactgaccac tcgaccaggt 1740 
tctgggttga caaatatcaa gacggaggag atctctgaag tgaagatgga tgcagaattc 1800 
cgacatgact caggatatga agttcatcat caaaaattgg tgttctttgc agaagatgtg 1860 
ggttcaaaca aaggtgcaat cattggactc atggtgggcg gtgttgtcat agcgacagtg 1920 
atcttcatca ccttggtgat gctgaagaag aaacagtaca catccattca tcatggtgtg 1980 
gtggaggttg acgccgctgt caccccagag gagcgccacc tgtccaagat gcagcagaac 2040 
ggctacgaaa atccaaccta caagttcttt gagcagatgc agaacaagaa gtag 2094 
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<210> 20 
<211> 697 
<212> PRT 

<213> Homo sapiens 
<400> 20 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
1 5 10 15 " 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 

20 25 30 

Gin He Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 
50 55 60 

Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 
85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 



He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 

115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 15*0 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 
165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Va] 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 



Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 

245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser He 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 



100 



105 



lie 



275 



280 



285 
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Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 

290 295 300 

Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 
325 330 335 

Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 
340 345 350 

Lys Lys Ala Val He Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 380 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 



Tyr He Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 
405 410 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arq Gin His 
420 425 430 

Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 

Ala Gin He Arg Ser Gin Val Met Thr His Leu Arg Val He Tyr Glu 
450 455 460 

Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Vai Pro Ala Val Ala 
465 470 475 480 

Glu Glu He Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 
485 490 495 

Tyr Ser Asp Asp Val Leu Ala Asn Met He Ser Glu Pro Arg He Ser 

500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 
515 520 525 

Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 

530 535 540 

Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 
545 550 555 560 

Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 
565 570 575 

Thr Arg Pro Gly Ser Gly Leu Thr Asn He Lys Thr Glu Glu He Ser 
580 585 590 

Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 
595 600 605 



His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys 
610 615 620 
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Gly Ala lie He Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 
625 630 635 640 

He Phe He Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser lie 
645 650 655 

His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 
660 665 670 

His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 
675 680 685 



Phe Phe Glu Gin Met Gin Asn Lys Lys 
690 695 



<210> 21 

<211> 1341 

<212> DNA 

<213> Homo sapiens 

<400> 21 

atggctagca tgactggtgg acagcaaatg ggtcgcggat ccacccagca cggcatccgg 6C 

ctgcccctgc gcagcggcct ggggggcgcc cccctggggc tgcggctgcc ccgggagacc 120 

gacgaagagc ccgaggagcc cggccggagg ggcagctttg tggagatggt ggacaacctg 180 

aggggcaagt cggggcaggg ctactacgtg gagatgaccg tgggcagccc cccgcagacg 240 

ctcaacatcc tggtggatac aggcagcagt aactttgcag tgggtgctgc cccccacccc 300 

ttcctgcatc gctactacca gaggcagctg tccagcacat accgggacct ccggaagggt 360 

gtgtatgtgc cctacaccca gggcaagtgg gaaggggagc tgggcaccga cctggtaagc 420 

atcccccatg gccccaacgt cactgtgcgt gccaacattg ctgccatcac tgaatcagac 480 

aagttcttca tcaacggctc caactgggaa ggcatcctgg ggctggccta tgctgagatt 540 

gccaggcctg acgactccct ggagcctttc tttgactctc tggtaaagca gacccacgtt 600 

cccaacctct tctccctgca cctttgtggt gctggcttcc ccctcaacca gtctgaagtg 660 

ctggcctctg tcggagggag catgatcatt ggaggtatcg accactcgct gtacacaggc 720 

agtctctggt atacacccat ccggcgggag tggtattatg aggtcatcat tgtgcgggtg 780 

gagatcaatg gacaggatct gaaaatggac tgcaaggagt acaactatga caagagcatt 84 0 

gtggacagtg gcaccaccaa ccttcgtttg cccaagaaag tgtttgaagc tgcagccaaa 900 

tccatcaagg cagcctcctc cacggagaag ttccctgatg gtttctggct aggagagcag S60 

ctggtgtgct ggcaagcagg caccacccct tggaacattt tcccagtcat ctcacictac 1020 

ctaatgggtg aggttaccaa ccagtccttc cgcatcacca tccttccgca gcaatacctg 1080 

cggccagtgg aagatgtggc cacgtcccaa gacgactgtt acaagtttgc catctcacag 1140 

tcatccacgg gcactgttat gggagctgtt atcatggagg gcttctacgt tgtctttgat 1200 

cgggcccgaa aacgaattgg ctttgctgtc agcgcttgcc atgtgcacga tgagttcagg 1260 

acggcagcgg tggaaggccc ttttgtcacc tcggacatgg aagactgtgg ctacaacatt 1320 

ccacagacag atgagtcatg a 1341 

<210> 22 

<21I> 446 

<212> PRT 

<213> Homo sapiens 

<400> 22 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg Gly Ser Thr Gin 
15 10 15 

His Gly He Arg Leu Pro Leu Arg Ser Gly Leu Gly Gly Ala Pro Leu 
20 25 30 

Gly Leu Arg Leu Pro Arg Glu Thr Asp Glu Glu Pro Glu Glu Pro Gly 
35 40 45 
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Arg Arg Gly Ser Phe Val Glu Met Val Asp Asn Leu Arg Gly Lys Ser 
50 55 60 

Gly Gin Gly Tyr Tyr Val Glu Met Thr Val Gly Ser Pro Pro Gin Thr 
65 70 75 80 

Leu Asn lie Leu Val Asp Thr Gly Ser Ser Asn Phe Ala Val Gly Ala 
85 90 95 

Ala Pro His Pro Phe Leu His Arg Tyr Tyr Gin Arg Gin Leu Ser Ser 
100 105 110 



Thr Tyr Arg Asp Leu Arg Lys Gly Val Tyr Val Pro Tyr Thr Gin Gly 
115 120 125 

Lys Trp Glu Gly Glu Leu Gly Thr Asp Leu val Ser He Pro His Gly 
130 135 140 

Pro Asn Val Thr Val Arg Ala Asn He Ala Ala He Thr Glu Ser Asp 
145 150 155 160 

Lys Phe Phe He Asn Gly Ser Asn Trp Glu Gly He Leu Gly Leu Ala 
165 170 175 

Tyr Ala Glu He Ala Arg Pro Asp Asp Ser Leu Glu Pro Phe Phe Asp 
180 185 190 

Ser Leu Val Lys Gin Thr His Val Pro Asn Leu Phe Ser Leu His Leu 
195 200 205 

Cys Gly Ala Gly Phe Pro Leu Asn Gin Ser Glu Val Leu Ala Ser Val 
210 215 220 

Gly Gly Ser Met He He Gly Gly He Asp His Ser Leu Tyr Thr Gly 

225 230 235 240 

Ser Leu Trp Tyr Thr Pro He Arg Arg Glu Trp Tyr Tyr Glu Val He 
245 250 255 

He Val Arg Val Glu He Asn Gly Gin Asp Leu Lys Met Asp Cys Lys 
260 265 270 

Glu Tyr Asn Tyr Asp Lys Ser He Val Asp Ser Gly Thr Thr Asn Leu 
275 280 285 

Arg Leu Pro Lys Lys Val Phe Glu Ala Ala Val Lys Ser He Lys Ala 
290 295 300 

Ala Ser Ser Thr Glu Lys Phe Pro Asp Gly Phe Trp Leu Gly Glu Gin 
305 310 315 320 

Leu Val Cys Trp Gin Ala Gly Thr Thr Pro Trp Asn He Phe Pro Val 
325 330 335 

He Ser Leu Tyr Leu Met Gly Glu Val Thr Asn Gin Ser Phe Arg He 

340 345 350 

Thr He Leu Pro Gin Gin Tyr Leu Arg Pro Val Glu Asp Val Ala Thr 
355 360 365 



Ser Gin Asp Asp Cys Tyr Lys Phe Ala He Ser Gin Ser Ser Thr Gly 
370 375 380 
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Thr Val Met Gly Ala Val He Met Glu Gly Phe Tyr Val Val Phe Asp 
385 390 395 400 

Arg Ala Arg Lys Arg He Gly Phe Ala Val Ser Ala Cys His Val His 
405 410 415 

Asp Glu Phe Arg Thr Ala Ala Val Glu Gly Pro Phe Val Thr Leu Asp 
420 425 430 



Met Glu Asp Cys Gly Tyr Asn He Pro Gin Thr Asp Glu Ser 
435 440 445 



<210> 23 
<211> 1380 
<212> DNA 

<213> Homo sapiens 
<400> 23 

atggctagca tgactggtgg acagcaaatg ggtcgcggat cgatgactat ctctgactct 60 
ccgcgtgaac aggacggatc cacccagcac ggcatccggc tgcccctgcg cagcggcctg 120 
gggggcgccc ccctggggct gcggctgccc cgggagaccg acgaagagcc cgaggagccc 180 
ggccggaggg gcagctttgt ggagatggtg gacaacctga ggggcaagtc ggggcagggc 240 
taccacgtgg agatgaccgt gggcagcccc ccgcagacgc tcaacaccct ggtggataca 300 
ggcagcagta actttgcagt gggtgctgcc ccccacccct tcctgcatcg ctactaccag 360 
aggcagctgt ccagcacata ccgggacctc cggaagggtg tgcatgtgcc ctacacccag 42C 
ggcaagtggg aaggggagct gggcaccgac ctggtaagca tcccccatgg ccccaacgtc 480 
actgtgcgtg ccaacattgc tgccatcact gaatcagaca agttcttcat caacggctcc 540 
aactgggaag gcatcctggg gctggcctat gctgagattg ccaggcctga cgactccctg 600 
gagcctttct ttgactctct ggtaaagcag acccacgttc ccaacctctt ctccctgcac 660 
ctttgtggtg ctggcttccc cctcaaccag tctgaagtgc tggcctctgt cggagggagc 720 
atgatcattg gaggtatcga ccactcgctg tacacaggca gtctctggta tacacccatc 780 
cggcgggagt ggtattatga ggtcatcatt gtgcgggtgg agatcaatgg acaggatctg 840 
aaaatggact gcaaggagta caactatgac aagagcattg tggacagtgg caccaccaac 900 
cttcgtttgc ccaagaaagc gtttgaagct gcagtcaaat ccatcaaggc agcctcctcc 960 
acggagaagt tccctgatgg tttctggcta ggagagcagc tggt.gtgctg gcaagcaggc 1020 
accacccctt ggaacatttt cccagtcatc tcactctacc taatgggtga ggttaccaac 1080 
cagtccttcc gcatcaccat ccttccgcag caatacctgc ggccagtgga agatgtggcc 1140 
acgtcccaag acgactgtta caagtttgcc atctcacagt catccacggg cactgttatg 1200 
ggagctgtta tcatggaggg cttctacgtt gtctttgatc gggcccgaaa acgaattggc 1260 
tttgctgtca gcgcttgcca tgtgcacgat gagttcagga cggcagcggt ggaaggccct 1320 
tttgtcacct tggacatgga agactgtggc tacaacattc cacagacaga tgagtcatga 1380 

<210> 24 

<211> 459 

<212> PRT 

<213> Homo sapiens 

<400> 24 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg Gly Ser Met Thr 
15 10 15 



He Ser Asp Ser Pro Arg Glu Gin Asp Gly Ser Thr Gin His Gly He 
20 25 30 

Arg Leu Pro Leu Arg Ser Gly Leu Gly Gly Ala Pro Leu Gly Leu Arg 
35 40 45 

Leu Pro Arg Glu Thr Asp Glu Glu Pro Glu Glu Pro Gly Arg Arg Gly 
50 55 60 
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Ser Phe Vai Glu Met Val Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly 
65 70 75 SO 

Tyr Tyr Val Glu Met Thr Val Gly Ser Pro Pro Gin Thr Leu Asn He 
85 90 95 

Leu Val Asp Thr Gly Ser Ser Asn Phe Ala Val Giy Ala Ala Pro His 
100 105 110 

Pro Phe Leu His Arg Tyr Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg 

115 120 125 

Asp Leu Arg Lys Gly Val Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu 
130 " 135 140 

Gly Glu Leu Gly Thr Asp Leu Val Ser He Pro His Gly Pro Asn Val 
145 150 155 160 

Thr Val Arg Ala Asn He Ala Ala He Thr Glu Ser Asp I^ys Phe Phe 
165 370 175 

He Asn Gly Ser Asn Trp Glu Gly He Leu Gly Leu Ala Tyr Ala Glu 
180 185 190 

He Ala Arg Pro Asp Asp Ser lieu Glu Pro Phe Phe Asp Ser Leu Val 
195 200 205 

Lys Gin Thr His Val Pro Asn Leu Phe Ser Leu His Leu C>'s Gly Ala 
210 215 220 

Gly Phe Pro Leu Asn Gin Ser Glu Val Leu Ala Ser Val Gly Gly Ser" 
225 230 235 240 

Met He He Gly Gly He Asp His Ser Leu Tyr Thr Gly Ser Leu Trp 
245 250 255 

Tyr Thr Pro He Arg Arg Glu Trp Tyr Tyr Glu Val He He Val Arg 

260 265 270 

Val Glu He Asn Gly Gin Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn 
275 280 285 

Tyr Asp Lys Ser He Val Asp Ser Gly Thr Thr Asn Leu Arg Leu Pro 
290 295 300 



Lys Lys Val Phe Glu Ala Ala Val Lys Ser He Lys Ala Ala Ser Ser 

305 310 315 320 

Thr Glu Lys Phe Pro Asp Gly Phe Trp Leu Gly Glu Gin Leu Val Cys 

325 330 335 

Trp Gin Ala Gly Thr Thr Pro Trp Asn He Phe Pro Val He Ser Leu 

340 345 350 

Tyr Leu Met Gly Glu Val Thr Asn Gin Ser Phe Arg He Thr He Leu 

355 360 365 

Pro Gin Gin Tyr Leu Arg Pro Val Glu Asp Val Ala Thr Ser Gin Asp 

370 375 380 

Asp Cys Tyr Lys Phe Ala He Ser Gin Ser Ser Thr Gly Thr Val Met 

385 390 395 400 
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Gly Ala Val lie Met GIj Gly ?he Tyr Val Val Phe Asp Arg Aia Arg 

405 410 415 

Lys Arg lie Gly Phe Ala Val 3er Ala Cys Kis Val His Asp Glu Phe 
420 425 430 

Arg Thr Ala Ala Val Glu Gly Pro Phe Val Thr Leu Asp Met Glu Asp 

435 440 445 

Cys Gly Tyr Asn He Pro Gin Thr Asp Glu Ser 
450 455 



<210> 25 

<211> 1302 

<212> DNA 

<213> Homo sapiens 

<400> 25 

atgactcagc atggtattcg tctgccactg cgtagcggtc tgggtggtgc tccactgggt 60 
ctgcgtctgc cccgggagac cgacgaagag cccgaggagc ccggccggag gggcagcttt 120 
gtggagatgg tggacaacct gaggggcaag tcggggcagg gctactacgt ggagatgacc 180 
gtgggcagcc ccccgcagac gctcaacatc ctggtggata caggcagcag taactttgca 240 
gtgggtgctg ccccccaccc cttcctacat cgctactacc agaggcagct gtccagcaca 300 
taccgggacc tccggaaggg tgtgtatgtg ccctacaccc agggcaagtg ggaaggggag 360 
ctgggcaccg acctggtaag catcccccat ggccccaacg tcactgtgcg tgccaacatt 420 
gctgccatca ctgaatcaga caagttcttc atcaacggct ccaactggga aggcatcctg 4 80 
gggctggcct atgctgagat tgccaggcct gacaactccc tggagccttt ctttgactct 540 
ctggtaaagc agacccacgt tcccaacctc ttctccctgc acctttgtgg tgctggcttc 600 
cccctcaacc agtctgaagt gctggcctct gccggaggga gcatgatcat tggaggtatc 660 
gaccactcgc cgtacacagg cagtctctgg tatacaccca tccggcggga gtggtattat 720 
gaggtcatca ttgtgcgggt ggagatcaat ggacaggatc tgaaaatgga ctgcaaggag 780 
tacaactatg acaagagcat tgtggacagt ggcaccacca accttcgttt gcccaagaaa 840 
gtgtttgaag ctgcagtcaa atccatcaag gcagcctcct ccacggagaa gttccctgat 900 
ggcttctgqc taggagagca gctggtcttgc tggcaagcag gcaccacccc ttggaacatt 960 
ttcccagtca tctcactcta cctaatgggt gaggttacca accagtcctt ccgcatcacc 1020 
atccttccgc agcaatacct gcggccagtg gaagatgtgg ccacgtccca agacgactgt 1080 
tacaagtttg ccatctcaca gtcatccacg ggcactgtta tgggagctgt tatcatggag 114C 
ggcttctacg ttgtctttga tcgggcccga aaacgaattg gctttgctgt cagcgcttgc 1200 
catgtgcacg atgagttcag gacggcagcg gtggaaggcc cttttgtcac cttggacatg 1260 
gaagactgtg gctacaacat tccacagaca gatgagtcat ga 1302 

<210> 26 

<2H> 433 

<212> PRT 

<213> Homo sapiens 

<400> 26 

Met Thr Gin His Gly He Arg Leu Pro Leu Arg Ser Gly Leu Gly Gly 
15 10 15 

Ala Pro Leu Gly Leu Arg Leu Pro Arg Glu Thr Asp Glu Glu Pro Glu 

20 25 30 

Glu Pro Gly Arg Arg Gly Ser Phe Val Glu Met Val Asp Asn Leu Arg 
35 40 45 

Gly Lys Ser Gly Gin Gly Tyr Tyr Val Glu Met Thr Val Gly Ser Pro 
50 55 60 



Pro Gin Thr Leu Asn He Leu Val Asp Thr Gly Ser Ser Asn Phe Ala 
65 70 75 80 
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Val Gly Ala Ala Pro His Pre Phe Leu His Arg Tyr Tyr Gin Arg Gin 

85 90 95 

Leu Ser Ser Thr Tyr Arg Asp Leu Arg Lys Gly Val Tyr Val' Pro Tyr 
100 * 105 110 

Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly Thr Asp Leu Val Ser lie 
115 120 125 

Pro His Gly Pro Asn Val Thr Val Arg Ala Asn lie Ala Ala lie Thr 
130 135 140 

Glu Ser Asp Lys Phe Phe He Asn Gly Ser Asn Trp Glu Gly He Lea 
145 250 155 160 

Gly Leu Ala Tyr Ala Glu He Ala Arg Fro Asp Asp Ser Leu Glu Pro 

165 170 175 

Phe Phe Asp Ser Leu Val Lys Gin Thr His Val Pro Asn Leu Phe Ser 
180 185 190 

Le\i His Leu Cys Gly Ala Gly Phe Pro Leu Asn Gin Ser Glu Val Leu 

195 200 205 

Ala Ser Val Gly Gly Ser Met He He Gly Gly He Asp His Ser Leu 
210 215 220 

Tyr Thr Gly Ser Leu Trp Tyr Thr Pro He Arg Arg Glu Ti^ Tyr Tyr 
225 230 235 ' 240 

Glu Val He lie Val Arg Val Glu He Asn Gly Gin Asp Leu Lys Met 
245 250 255 

Asp Cys Lys Glu Tyr Asn Tyr Asp Lys Ser He Val Asp Ser Gly Thr 
260 265 270 

Thr Asn Leu Arg Leu Pro Lys Lys Val Phe Glu Ala Ala Val Lys Ser 
275 280 285 

He Lys Ala Ala Ser Ser Thr Glu Lys Phe Pro Asp Gly Phe Trp Leu 
290 295 300 

Gly Glu Gin Leu Val Cys Trp Gin Ala Gly Thr Thr Pro Trp Asn He 
305 310 315 320 

Phe Pro Val He Ser Leu Tyr Leu Met Gly Glu Val Thr Asn Gin Ser 
325 330 335 

Phe Arg He Thr He Leu Pro Gin Gin T/r Leu Arg Pro Val Glu Asp 
340 345 350 

Val Ala Thr Ser Gin Asp Asp Cys Tyr Lys Phe Ala He Ser Gin Ser 
355 360 365 

Ser Thr Gly Thr Val Met Gly Ala Val He Met Glu Gly Phe Tyr Val 
370 375 380 

Val Phe Asp Arg Ala Arg Lys Arg He Gly Phe Ala Val Ser Ala Cys 
385 390 395 400 

His Val His Asp Glu Phe Arg Thr Ala Ala Val Glu Gly Pro Phe Val 
405 410 415 
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Thr Leu Asp Met Glu Asp Cys Gly Tyr Asn lie Pro Gin Thr Asp Glu 
420 425 430 

Ser 

<210> 27 
<211> 1278 
<212> DNA 

<213> Homo sapiens 



<400> 27 

atggctagca 

ccgctggact 

ggcaagtcgg 

aacatcctgg 

ctgcatcgct 

tatgtgccct 

ccccatggcc 

ttcttcatca 

aggcctgacg 

aacctcttct 

gcctctgtcg 

ctctggtata 

atcaatggac 

gacagtggca 

atcaaggcag 

gtgtgctggc 

atgggtgagg 

ccagtggaag 

tccacgggca 

gcccgaaaac 

gcagcggtgg 

cagacagatg 



tgactggtgg 
ctggtatcga 
ggcagggcta 
tggatacagg 
actaccagag 
acacccaggg 
ccaacgtcac 
acggctccaa 
actccccgga 
ccctgcacct 
gagggagcat 
cacccatccg 
aggatctgaa 
ccaccaacct 
cctcctccac 
aagcaggcac 
ttaccaacca 
atgtggccac 
ctgttatggg 
gaattggctt 
aaggcccttt 
agtcatga 



acagcaaatg 
aaccgacgga 
ctacgtggag 
cagcagtaac 
gcagctgtcc 
caagtgggaa 
tgtgcgcgcc 
ctgggaaggc 
gcctttcttt 
ttgtggtgct 
gatcattgga 
gcgggagtgg 
aatggactgc 
tcgtttgccc 
ggagaagttc 
caccccttgg 
gtccttccgc 
gtcccaagac 
agctgttatc 
tgctgtcagc 
tgtcaccttg 



ggtcgcggat 
tcctttgtgg 
atgaccgtgg 
tttgcagtgg 
agcacatacc 
ggggagctgg 
aacattgctg 
atcctggggc 
gactctctgg 
ggcttccccc 
ggtatcgacc 
tattatgagg 
aaggagtaca 
aagaaagtgt 
cct.gatggtt 
aacattttcc 
atcaccatcc 
gactgttaca 
atggagggct 
gcttgccatg 
gacatggaag 



cgatgactat 
agatggtgga 
gcaqcccccc 
gtgctgcccc 
gggacctccg 
gcaccgacct 
ccatcactga 
tggcctatgc 
taaagcagac 
tcaaccagtc 
actcgctgta 
tcatcattgt 
actatgacaa 
ttgaagctgc 
tctggctagg 
cagtcatctc 
ttccgcagca 
agtttgccat 
tctacgttgt 
tgcacgatga 
actgtggcta 



ctctgactct 
caacctgagg 
gcagacgctc 
ccaccccttc 
gaagggtgtg 
ggtaagcatc 
atcagacaag 
tgagattgcc 
ccacgttccc 
tgaagtgctg 
cacaggcagt 
gcgggtggag 
gagcattgtg 
agtcaaatcc 
agagcagctg 
actctaccta 
atacctgcgg 
ctcacactca 
ctttgatcgg 
gttcaggacg 
caacattcca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1278 



<210> 28 
<211> 425 
<212> PRT 

<213> Homo sapiens 
<400> 28 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg Gly Ser Met Thr 
15 10 15 

lie Ser Asp Ser Pro Leu Asp Ser Gly lie Glu Thr Asp Gly Ser Phe 
20 25 30 

Val Glu Met Val Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly Tyr Tyr 
35 40 45 

Val Glu Met Thr Val Gly Ser Pro Pro Gin Thr Leu Asn He Leu Val 
50 55 60 

Asp Thr Gly Ser Ser Asn Phe Ala Val Gly Ala Ala Pro His Pro Phe 
65 70 75 80 

Leu His Arg Tyr Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu 

85 90 ' 95 

Arg Lys Gly Val Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu 
100 105 110 



Leu Gly Thr Asp Leu Val Ser lie Pro His Gly Pro Asn Val Thr Val 
115 120 125 
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Arg Ala Asn He Ala Ala He Thr Glu Ser Asp Lys Phe Phe He Asn 
130 135 140 

Gly Ser Asn Trp Glu Gly lie Leu Gly Leu Ala Tyr Ala Glu He Ala 
145 150 155 160 

Arg Pro Asp Asp Ser Leu Glu Pro Phe Phe Asp Ser Leu Val Lys Gin 
165 170 * 175 

Thr His Val Pro Asn Leu Phe Ser Leu His Leu Cys Gly Ala Gly Phe 
180 185 190 

Pro Leu Asn Gin Ser Glu Val Leu Ala Ser Val G.ty Gly Ser Met He 
195 200 205 

He Gly Gly He Asp His Ser Leu Tyr Thr Gly Ser Leu Trp Tyr Thr 
210 215 220 

Pro He Arg Arg Glu Trp Tyr Tyr Glu Val He He Val Arg Val Glu 
225 230 235 240 

He Asn Gly Gin Asp 'Leu Lys Met Asp Cys Lys Glu Tyr Asn Tyr Asp 
245 250 255 

Lys Ser He Val Asp Ser Gly Thr Thr Asn Leu Arg Leu Pro Lys Lys 
260 265 270 

Val Phe Glu Ala Ala Val Lys Ser He Lys Ala Ala Ser Ser Thr Glu 
275 280 285 

Lys Phe Pro Asp Gly Phe Trp Leu Gly Glu Gin Leu Val Cys Trp Gin 
290 295 300 

Ala Gly Thr Thr Pro Trp Asn He Phe Pro Val He Ser Leu Tyr Leu 
305 310 315 320 

Met Gly Glu Val Thr Asn Gin Ser Phe Arg He Thr He Leu Pro Gin 

325 330 335 

Gin Tyr Leu Arg Pro Val Glu Asp Val Ala Thr Ser Gin Asp Asp Cys 
340 345 350 

Tyr Lys Phe Ala He Ser Gin Ser Ser Thr Gly Thr Val Met Gly Ala 
355 360 365 

Val He Met Glu Gly Phe Tyr Val Val Phe Asp Arg Ala Arg Lys Arg 
370 375 380 

He Gly Phe Ala Val Ser Ala Cys His Val His Asp Glu Phe Arg Thr 
385 390 395 400 

Ala Ala Val Glu Gly Pro Phe Val Thr Leu Asp Met Glu Asp Cys Gly 
405 410 415 

Tyr Asn He Pro Gin Thr Asp Glu Ser 
420 425 



<210> 29 

<2H> 1362 

<212> DNA 

<213> HoiTK) sapiens 
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<400> 29 

atggcccaag ccctgccctg gctcctgctg tggatgggcg cgggagtgct gcctgcccac 60 

ggcacccagc acggcatccg gctgcccctg cgcagcggcc tggggggcgc ccccctgggg 120 

ctgcggctgc cccgggagac cgacgaagag cccgaggagc ccggccggag gggcagcttt 180 

gtggagatgg tggacaacct gaggggcaag tcggggcagg gctactacgt ggagacgacc 240 

gtgggcagcc ccccgcagac gctcaacatc ctggtggata caggcagcag taacttr.gca 300 

gtgggtgctg ccccccaccc cttcctccat cgctactacc agaggcagct gtccagcaca 360 

taccgggacc tccggaaggg tgtgtatgtg ccctacaccc agggcaagtg ggaaggggag 420 

ctgggcaccg acctggtaag catcccccat ggccccaacg tcactgtgcg tgccaacatt 4 80 

gctgccatca ctgaatcaga caagttcttc atcaacggct ccaactggga aggcatcctg 540 

gggctggcct atgctgagat tgccagccct gacgactccc tggagccttt ctttgaccct 600 

ctggtaaagc agacccacgt tcccaacctc ttctccctgc acctttgtgg tgctggcttc 660 

cccctcaacc agtctgaagt gctggcctct gtcggaggga gcatgatcat tggaggtatc 720 

gaccactcgc tgtacacagg cagtctctgg tatacaccca tccggcggga gtggtattat 780 

gaggtcatca ttgtgcgggt ggagatcaat ggacaggatc tgaaaatgga ctgcaaggag 840 

tacaactatg acaagagcat tgtggacagt ggcaccacca accttcgttt gcccaagaaa 900 

gtgtttgaag ccgcagtcaa atccatcaag gcagcctcct ccacggagaa gttccctgat 960 

ggtttctggc taggagagca gctggtgtgc tggcaagcag ccaccacccc ttggaacatt 1020 

ttcccagtca tctcactcta cctaatgggt gaggttacca accagtcctt ccgcatcacc 1080 

atccttccgc agcaatacct gcggccagtg gaagatgtgg ccacgtccca agacgactgt 1140 

tacaagtttg ccatctcaca gtcatccacg ggcactgtta tgggagctgt tatcatggag 1200 

ggcttctacg ttgtctttga tcgggcccga aaacgaattg gctttgctgt cagcgcttgc 1260 

catgtgcacg atgagttcag gacggcagcg gtggaaggcc cttttgtcac cttggacatg 1320 

gaagactgtg gctacaacat tccacagaca gatgagtcat ga 1362 

<210> 30 

<211> 453 

<212> PRT 

<213> Homo sapiens 

<400> 30 

Met Ala Gin Ala Leu Pro Trp Leu Leu Leu Trp Met Gly Ala Gly Vax 
15 10 15 

Leu Pro Ala His Gly Thr Gin His Gly lie Arg Leu Pro Leu Arg Ser 
20 25 30 

Gly Leu Gly Gly Ala Pro Leu Gly Leu Arg Leu Pro Arg Glu Thr Asp 
35 . 40 45 

Glu Glu Fro Glu Glu Pro Gly Arg Arg Gly Ser Phe Val Glu Met Val 
50 55 60 

Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly Tyr Tyr Val Glu Met Thr 
65 70 75 80 

Val Gly Ser Pro Pro Gin Thr Leu Asn He Leu Val Asp Thr Gly Ser 
85 90 95 

Ser Asn Phe Ala Val Gly Ala Ala Pro His Pro Phe Leu His Arg Tyr 
100 105 110 

Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu Arg Lys Gly Val 
115 120 125 

Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly Thr Asp 
130 135 140 



Leu Val Ser He 
145 



Pro His Gly Pro Asn Val Thr 
150 155 



Val Arg Ala Asn He 
160 
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Ala Ala He Thr Glu Ser Asp Lys Phe Phe He Asn Gly Ser Asn Trp 
165 170 176 

Glu Gly He Leu Gly Leu Ala Tyr Ala Glu He Ala Arg Pro Asp Asp 
180 185 190 

Ser Leu Glu Pro Phe Phe Asp Ser Leu Val Lys Gin Thr His Val Pro 
195 200 205 

Asn Leu Phe Ser Leu Gin Leu Cys Gly Ala Gly Phe Pro Leu Asn Gin 
210 215 220 



Ser Glu Val Leu Ala Ser Val Gly Gly Ser Met He He Gly Gly He 
225 230 235 240 

Asp His Ser Leu Tyr Thr Gly Ser Leu Trp Tyr Thr Pro He Arg Arg 
245 250 255 

Glu Trp Tyr Tyr Glu Val He He Val Arg Val Giu He Asn Gly Gin 
260 265 270 • 

Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn Tyr Asp Lys Ser He Val 
275 280 285 

Asp Ser Gly Thr Thr Asn Leu Arg Leu Pro Lys Lys Val Pne Glu Ala 
290 295 300 

Ala Val Lys Ser He Lys Ala Ala Ser Ser Thr Glu Lys Phe Pro Asp 
305 310 315 320 

Giy Phe Trp Leu Gly Glu Gin Leu Val Cys Trp Gin Ala Gly Thr Thr 
325 330 335 

Pro Trp Asn He Phe Pro Val He Ser Leu Tyr Leu Met Gly Glu Val 
340 345 350 

Thr Asn Gin Ser Phe Arg lie Thr He Leu Pro Gin Gin Tyr Leu Arg 
355 360 365 

Pro Val Glu Asp Val Ala Thr Ser Gin Asp Asp Cys Tyr Lys Phe Ala 
370 375 330 

He Ser Gin Ser Ser Thr Gly Thr Val Met Gly Ala Val He Met Glu 
385 390 395 400 

Gly Phe Tyr Val Val Phe Asp Arg Ala Arg Lys Arg He Gly Phe Ala 
405 410 415 



Val Ser Ala Cys His Val His Asp Glu Phe Arg Thr Ala Ala Val Glu 
420 425 430 



Gly Pro Phe Val Thr Leu Asp Met Glu Asp Cys Gly Tyr Asn He Pro 
435 440 445 

Gin Thr Asp Glu Ser 
450 



<210> 31 
<211> 1380 
<212> DNA 
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<213> Homo sapiens 



<400> 31 

atggcccaag ccctgccctg gctcctgctg tggatgggcg cgggagtgct gcctgcccac 60 
ggcacccagc acggcatccg gctgcccctg cgcagcggcc tggggggcgc ccccctgggg 120 
ctgcggctgc cccgggagac cgacgaagag cccgaggagc ccggccggag gggcagctt-t 180 
gtggagatgg cggacaacct gaggggcaag tcggggcagg gctactacgt ggagatgacc 240 
gtgggcagcc ccccgcagac gctcaacatc ctggtggata caggcagcag taactttgca 300 
gtgggtgctg ccccccaccc cttcctgcat cgctactacc aqaggcagct gtccagcaca 360 
taccgggacc tccggaaggg tgtgtatgtg ccctacaccc agggcaagtg ggaaggggag 420 
ctgggcaccg acctggtaag catcccccat ggccccaacg tcactgtgcg cgccaacatt 480 
gctgccatca ctgaatcaga caagttcttc atcaacggct ccaactggga aggcatcccg 540 
gggct.ggcct atgctgagat tgccaggcct gaccactccc cggagccttt ctttgactct 600 
ctggtaaagc agacccacgt tcccaacctc ttct.ccctgc acctttgtgg tgctggctLc 660 
cccctcaacc agtctgaagt gctggcctct gtcggaggga gcatgatcat tggaggta.cc 720 
gaccactcgc tgtacacagg cagtctctgg tatacaccca cccggcggga gtggtattat 780 
gaggtcatca ttgtgcgggt ggagatcaat ggacaggatc tcaaaatgga ctgcaaggag 84 0 
tacaactatg acaagagcat tgtggacagt ggcaccacca accttcgttt gcccaagaaa 900 
gtgtttgaag ctgcagtcaa atccatcaag gcagcctcct ccacggagaa gttccctgat 960 
ggtttctggc taggagagca gctggtgtgc tggcaagcag gcaccacccc ttggaacatt 102 0 
ttcccagtca tctcactcta cctaatgggt gaggttacca accagtcctt ccgcatcacc 1080 
atccttccgc agcaatacct gcggccagtg gaagatgtgg ccacgtccca agacgactgt 114 0 
tacaagtttg ccatctcaca gtcatccacg ggcactgtta tgggagctgt tatcatggag 1200 
ggcttctacg ttgtctttga tcgggcccga aaacgaattg gctttgctgt cagcgcttgc 1260 
catgtgcacg atgagttcag gacggcagcg gtggaaggcc ctttcgtcac cttggacatg 1320 
gaagactgtg gctacaacat tccacagaca gatgagtcac agcagcagca gcagcagtga 1380 

<210> 32 
<211> 459 
<212> PRT 

<213> Homo sapiens 
<400> 32 

Met Ala Gin Ala Leu Pro Trp Leu Leu Leu Trp Met Gly Ala Gly Val 
15 10 15 

Leu Pro Ala His Gly Thr Gin His Gly lie Arg Lea Pro Leu Arg Ser 

20 25 , 30 

Gly Leu Gly Gly Ala Pro Leu Gly Leu Asg Leu Pro Arg Glu Thr Asp 
35 40 45 

Glu Glu Pro Glu Glu Pro Gly Arg Arg Gly Ser Phe Val Glu Met Val 
50 55 60 

Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly Tyr Tyr Val Glu Met Thr 
65 70 75 80 

Val Gly Ser Pro Pro Gin Thr Leu Asn lie Leu Val Asp Thr Gly Ser 
85 90 95 

Ser Asn Phe Ala Val Gly Ala Ala Pro Kis Pro Phe Leu His Arg Tyr 
100 105 110 

Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu Arg Lys Gly Val 
115 120 " 125 

Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly Thr Asp 
130 135 140 



Leu Val Ser He Pro His Gly Pro Asn Val Thr Val Arg Ala Asn He 
145 150 155 160 
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Ala Ala lie Thr Glu Ser Asp Lys Phe Phe He Asn Gly Ser Asn Trp 
165 170 175 

Glu Gly lie Leu Gly Leu Ala Tyr Ala Glu He Ala Arg Pro Asp Asp 
180 185 190 

Ser Leu Glu Pro Phe Phe Asp Ser Leu Val Lys Gin Thr His Val Pro 
195 200 205 

Asn Leu Phe Ser Leu Gin Leu Cys Gly Ala Gly Phe Pro Leu Asn Gin 
210 215 220 



Ser Glu Val Leu Ala Ser Val Gly Gly Ser Met He He Gly Gly He 

225 230 . 235 240 

Asp His Ser Leu Tyr Thr Gly Ser Leu Trp Tyr Thr Pro He Arg Arg 
245 250 255 

Glu Trp Tyr Tyr Glu Val He He Val Arg Val Glu He Asn Gly Gin 
260 265 270 

Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn Tyr Asp Lys Ser He Val 
275 280 285 

Asp Ser Gly Thr Thr Asn Leu Arg Leu Pro Lys Lys Val Phe Glu Ala 
290 295 300 

Ala Val Lys Ser He Lys Ala Ala Ser Ser Thr Glu Lys Phe Pro Asp 

305 310 315 320 

Gly Phe Trp Leu Gly Glu Gin Leu Val Cys Trp Gin Ala Gly Thr Thr 
325 330 335 

Pro Trp Asn He Phe Pro Val He Ser Leu Tyr Leu Met Gly Glu Val 
340 345 350 



Thr Asn Gin Ser Phe Arg He Thr He Leu Pro Gin Gin Tyr Leu Arg 

355 360 365 

Pro Val Glu Asp Val Ala Thr Ser Gin Asp Asp Cys Tyr Lys Phe Ala 
370 375 330 

He Ser Gin Ser Ser Thr Gly Thr Val Met Gly Ala Val He Met Glu 
385 390 395 400 

Gly Phe Tyr Val Val Phe Asp Arg Ala Arg Lys Arg He Gly Phe Ala 
405 410 415 

Val Ser Ala Cys His Val His Asp Glu Phe Arg Thr Ala Ala Val Glu 
420 425 430 



Gly Pro Phe Val Thr Leu Asp Met Glu Asp Cys Gly Tyr Asn He Pro 
435 440 445 



Gin Thr Asp Glu Ser His His His His His His 
450 455 



<210> 33 
<211> 25 
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<212> PRT 

<213> Homo sapiens 

<400> 33 

Ser Glu Gin Gin Arg Arg Pro Arg Asp Pro Glu Val Val Asn Asp Glu 
1 5 * 10 15 

Ser Ser Leu Val Arg His Arg Trp Lys 
20 25 



<210> 34 

<211> 19 

<212> PRT 

<213> Homo sapiens 

<400> 34 

Ser Glu Gin Leu Arg Gin Gin His Asp Asp Phe Ala Asp Asp lie Ser 
15 10 15 

Leu Leu Lys 



<210> 35 
<211> 29 
<212> DNA 

<213> Homo sapiens 
<4C0> 35 

gtggatccac ccagcacggc atccggctg 29 

<210> 36 

<211> 36 

<212> DNA 

<213> Homo sapiens 

<400> 36 

gaaagctttc atgactcatc tgtctgtgga atgttg 36 

<210> 37 

<211> 39 

<212> DNA 

<213> Homo sapiens 

<400> 37 

gatcgatgac tatctctgac tctccgcgtg aacaggacg 39 

<210> 38 

<211> 39 

<212> DNA 

<213> Homo sapiens 

<400> 38 

gatccgtcct gttcacgcgg agagtcagag acagtcatc 39 

<210> 39 
<211> 77 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: HU"Asp2 
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<400> 39 

cggcatccgg ctgcccctgc gtagcggtcc gggtggtgct ccactgggtc tgcgtctgcc 60 
ccgggagacc gacgaag 77 

<2iO> 40 
<2ia> 77 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hu-Asp2 
<400> 40 

cttcgtcggt ctcccggggc agacgcagac ccaotggagc accacccaga ccgctacgca 60 
ggggcagccg gatgccg 77 

<210> 41 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase 8 
Cleavage Site 

<400> 41 

gatcgatgac tatctctgac tctccgctgg actctggtat cgaaaccgac g 51 

<210> 42 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase 8 
Cleavage Site 

<400> 42 

gatccgtcgg tttcgatacc agagtccagc ggagagtcag agatagtcat c 51 

<2i0> 43 
<211> 32 
<212> DNA 

<213> Homo sapiens 
<400> 43 

aaggatcctt tgtggagatg gtggacaacc tg 32 

<210> 44 

<211> 36 

<212> DNA 

<213> Homo sapiens 

<400> 44 

gaaagctttc atgactcatc tgtctgtgga atgttg 36 

<210> 45 

<211> 24 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: 6-His tag 



47 

iccac tcgaccaggt tc 



22 



48 
51 
DNA 

Artificial Seauence 



Description of Artificial Sequence: primer 
48 

laaat tccagcacac tggctactcc ttgttctgca tctcaaagaa c 51 

49 
26 
DNA 

Artificial Seauence 



Description of Artificial Sequence: primer 
49 

taaat tccagcacac tggcta 26 
50 

1287 
DNA 

Artificial Sequence 



Description of Artificial Sequence: Hu-Asp2(b) 
delta TM 

50 

ccaag ccctgccctg gctcctgctg tggatgggcg cgggagtgct gcctgcccac 60 
ccagc acggcatccg gctgcccctg cgcagcggcc tggggggcgc ccccctgggg 120 
gctgc cccgggagac cgacgaagag cccgaggagc ccggccggag gggcagcttt 180 
gatgg tggacaacct gaggggcaag tcggggcagg gctactacgt ggagatgacc 240 
cagcc ccccgcagac gctcaacatc ctggtggata caggcagcag taactttgca 300 
tgctg ccccccaccc cttcctgcat cgctactacc agaggcagct gtccagcaca 360 
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24 



45 

atca tcaccatcac cacg 

46 
24 
DNA 

Artificial Sequence 



Description of Artificial Sequence: 6-His tag 
46 

itggt gacggtgacg atgc 24 

47 

22 
DNA 

Artificial Sequence 



Description of Artificial Sequence: primer 
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taccgggacc cccggaaggg tgtgtatg^g ccctacaccc agggcaagtg ggaaggggag 420 

ctgggcaccg acctggtaag catcccccat ggccccaacg tcactgtgcg tgccaacatt 480 

gctgccatca ctgaatcaga caagttcttc atcaacggct ccaactggga aggcatcctg 540 

gggctggcct atgctgagat tgccaggctt tgtggtgcng gcttccccct caaccagtct 600 

gaagtgctgg cctctgtcgg agggagcatg atcactggag gtatcgacca ctcgctgtac 660 

acaggcagtc tctggtatac acccatccgg cgggagtggt attatgaggt catcattgtg 720 

cgggtggaga tcaatggaca ggatctgaaa atggactgca aggagtacaa ctatgacaag 780 

agcattgtgg acagtggcac caccaacctt cgtttgccca agaaagtgtt tgaagctgca 840 

gtcaaatcca tcaaggcagc ctcctccacg gagaagttcc ctgatggttt ctggctagga 900 

gagcagctgg tgtgctggca agcaggcacc accccttgga acattttccc agtcatctca 960 

ctceacctaa tgggtgaggt taccaaccag tcctcccgca tcaccatcct tccgcagcaa 1020 

tacctgcggc cagtggaaga tgtggccacg tcccaagacg actgttacaa gtttgccatc 1080 

tcacagtcat ccacgggcac tgttatggga gctgttatca tqgagggctt ctacgttgtc 1140 

tttgatcggg cccgaaaacg aattggcttt gctgccagcg cttgccatgt gcacgacgag 1200 

ttcaggacgg cagcggtgga aggccctttt gtcaccttgg acatggaaga ctgtggctac 1260 

aacattccac agacagatga gtcatga 1287 

<210> 51 
<211> 428 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hu-Asp2(b) 
delta TM 

<400> 51 

Met Ala Gin Ala Leu Pro Trp Leu Leu Leu Trp Met Gly Ala Gly Val 
1 5 10 15 

Leu Pro Ala His Gly Thr Gin His Gly lie Arg Leu Pro Leu Arg Ser 

20 25 30 

Gly Leu Gly Gly Ala Pro Leu Gly Leu Arg Leu Pro Arg Glu Thr Asp 
35 40 45 

Glu Glu Pro Glu Glu Pro Gly Arg Arg Gly Ser Phe Val Glu Met Val 
50 55 60 

Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly Tyr T>'r Val Glu Met Thr 
65 70 75 80 

Val Gly Ser Pro Pro Gin Thr Leu Asn lie Leu Val Asp Thr Gly Ser 
85 90 95 

Ser Asn Phe Ala Val Gly Ala Ala Pro His Pro Phe Leu His Arg Tyr 
100 105 110 

Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu Arg Lys Gly Val 
115 120 125 

Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly Thr Asp 
130 135 140 

Leu Val Ser lie Pro His Gly Pro Asn Val Thr Val Arg Ala Asn He 
145 150 155 160 

Ala Ala He Thr Glu Ser Asp Lys Phe Phe He Asn Gly Ser Asn Trp 

165 170 175 



Glu Gly He Leu Gly Leu Ala Tyr Ala Glu He Ala Arg Leu Cys Gly 
180 185 190 
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Ala Gly Phe Pro Leu Asn Gin £Jer Glu Val Leu Ala Ser Val Gly Gly 
195 200 205 

Ser Met lie He Gly Gly He Asp His Ser Leu Tyr Thr Gly Ser Leu 
210 215 220 

Trp Tyr Thr Pro He Arg Arg Glu Trp Tyr Tyr Giu Val I3e He Val 
225 230 235 240 

Arg Val Glu He Asn Gly Gin Asp Leu Lys Met Asp Cys Lys Glu Tyr 

245 250 255 

Asn Tyr Asp Lys Ser He Val Asp Ser Gly Thr Thr Asn Leu Arg Leu 
260 265 270 

Pro Lys Lys Val Phe Glu Ala Ala Val Lys Ser He Lys Ala Ala Ser 

275 280 285 

Ser Thr Giu Lys Phe Pro Asp Gly Phe Trp Leu Gly Glu Gin Leu Vai 
290 295 300 

Cys Trp Gin Ala Gly Thr Thr Pro Trp Asn He Phe Pro Val He Ser 
305 310 315 320 

Leu Tyr Leu Met Gly Glu Val Thr Asn Gin Ser Phe Arg He Thr He 
325 330 335 

Leu Pro Gin Gin Tyr Leu Arg Pro Val Glu Asp Val Ala Thr Ser Gin 
340 345 350 

Asp Asp Cys Tyr Lys Phe Ala He Ser Gin Ser Ser Thr Gly Thr Val 

355 360 365 

Met Gly Ala Val lie Met Glu Gly Phe Tyr Val Val Phe Asp Arg Ala 
370 375 380 

Arg Lys Arg He Gly Phe Ala Val Ser Ala Cys His Val His Asp Glu 
385 390 395 400 

Phe Arg Thr Ala Ala Val Glu Gly Pro Phe Val Thr Leu Asp Met Giu 
405 410 415 



Asp Cys Gly Tyr Asn He Pro Gin Thr Asp Glu Ser 
420 425 



<210> 52 
<211> 1305 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hu-Asp2 (b) 
delta TM 

<400> 52 

atggcccaag ccctgccctg gctcctgctg tggatgggcg cgggagtgct gcctgcccac 60 

ggcacccagc acggcatccg gctgcccctg cgcagcggcc tggggggcgc ccccccgggg 120 

ctgcggctgc cccgggagac cgacgaagag cccgaggagc ccggccggag gggcagcttt 180 

gtggagatgg tggacaacct gaggggcaag tcggggcagg gctactacgt ggagacgacc 240 

gtgggcagcc ccccgcagac gctcaacatc ctggtggata caggcagcag taactctgca 300 

gtgggtgctg ccccccaccc cttcccgcat cgctactacc agaggcagct gtccagcaca 360 

taccgggacc tccggaaggg tgtgtatgtg ccctacaccc agggcaagtg ggaaggggag 420 
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ctgggcaccg acctggcaag catcccccat ggccccaacg tcactgtgcg tgccaacatc 4 80 
gctgccatca ctgaatcaga caagttcttc atcaacggct ccaactggga aggcatcctg 54 0 
gggctggcct atgctgagat cgccaggctt tgtggtgctg gcttcccccc caaccagtct 600 
gaagtgctgg cctctgtcgg agggagcatg atcattggag gtatcgacca ctcgctgtac 660 
acaggcagtc tctggtatac acccatccgg cgggagtggt attatgaggt catcattgtg 720 
cgggtggaga tcaatggaca ggatctgaaa atggactgca aggagtacaa ctatgacaag 780 
agcattgtgg acagtggcac caccaacctt cgtttgccca agaaagtgtt tgaagctgca 840 
gtcaaatcca tcaaggcagc ctcctccacg gagaagttcc ctgatggttt ctggctagga 900 
gagcagctgg tgtgctggca agcaggcacc accccttgga acattttccc agtcatctca 960 
ctctacctaa tgggtgaggc taccaaccag tccttccgca tcaccatcct tccgcagcaa 1020 
tacctgcggc cagtggaaga tgtggccacg tcccaagacg actgttacaa gtttgccatc 1080 
tcacagtcat ccacgggcac tgttatggga gctgttatca tggagggctt ctacgttgtc 1140 
tttgatcggg cccgaaaacg aattggcttt gctgt.cagcg cttgccatgt gcacgatgag 1200 
ttcaggacgg cagcggtgga aggccctttt gtcaccttgg acatggaaga ctgtggctac 1260 
aacattccac agacagatga gtcacagcag cagcagcagc agtga 1305 

<210> 53 
<211> 434 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hu-Asp2(b) 
delta TM 

<400> 53 

Met Ala Gin Ala Leu Pro Trp Leu Leu Leu Trp Met Gly Ala Gly Val 
15 10 15 

Leu Pro Ala His Gly Thr Gin His Gly He Arg Leu Pro Leu Arg Ser 
20 25 30 

Gly Leu Gly Gly Ala Pro Leu Gly Leu Arg Leu Pro Arg Glu Thr Asp 
35 40 45 

Glu Glu Pro Glu Glu Pro Gly Arg Arg Gly Ser Phe Val Glu Met Val 
50 55 60 

Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly Tyr Tyr Val Glu Met Thr 

65 70 75 80 

Val Gly Ser Pro Pro Gin Thr Leu Asn He Leu Val Asp Thr Gly Ser 
85 90 95 

Ser Asn Phe Ala Val Gly Ala Ala Pro His Pro Phe Leu His Arg Tyr 
100 105 110 

Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu Arg Lys Gly Val 
115 120 125 



Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly Thr Asp 

130 135 140 

Leu Val Ser He Pro His Gly Pro Asn Val Thr Val Arg Ala Asn He 
145 X50 155 160 

Ala Ala He Thr Glu Ser Asp Lys Phe Phe He Asn Gly Ser Asn Trp 
165 170 . 175 



Glu Gly He Leu Gly Leu Ala Tyr Ala Glu He Ala Arg Leu Cys Gly 
180 185 190 
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Ala Gly Phe Pro Leu Asn Gin Ser Glu Val Leu Ala Ser Val Gly Gly 
195 200 205 

Ser Met lie lie Gly Gly lie Asp His Ser Leu Tyr Thr Gly Ser Leu 
210 215 220 

Trp Tyr Thr Pro He Arg Arg Glu Trp Tyr Tyr Glu Val lie He Val 
225 230 235 240 

Arg Val Glu He Asn Gly Gin Asp Leu Lys Met Asp Cys Lys Glu Tyr 

245 250 255 



Asn Tyr Asp Lys Ser He Val Asp Ser Gly Thr Thr Asn Leu Arg Leu 
260 265 270 

Pro Lys Lys Val Phe Glu Ala Ala Val Lys Ser He Lys Aia Ala Ser 
275 280 285 

Ser Thr Glu Lys Phe Pro Asp Gly Phe Trp Leu Gly Glu Gin Leu Val 
290 295 ' 300 

Cys Trp Gin Ala Gly Thr Thr Pro Trp Asn He Phe Pro Val He Ser 
305 310 315 320 

Leu Tyr Leu Met Gly Glu Val Thr Asn Gin Ser Phe Arg He Thr I]e 
325 330 335 

Leu Pro Gin Gin Tyr Leu Arg Pro Val Glu Asp Val Ala Thr Ser Gin 

340 345 350 

Asp Asp Cys Tyr Lys Phe Ala He Ser Gin Ser Ser Thr Gly Thr Val 
355 360 365 

Met Gly Ala Val He Met Glu Gly Phe Tyr Val Val Phe Asp Arg Ala 
370 375 380 



Arg Lys Arg He Gly Phe Ala Val Ser Ala Cys His Val His Asp Glu 
385 390 395 400 

Phe Arg Thr Ala Ala Val Glu Gly Pro Phe Val Thr Leu Asp Met Glu 
405 410 415 

Asp Cys Gly Tyr Asn He Pro Gin Thr Asp Glu Ser His His His His 
420 425 430 

His His 



<210> 54 
<211> 2310 
<212> DNA 

<213> Homo sapiens 
<400> 54 

atgctgcccg gtttggcact gctcctgctg gccgcctgga cggctcgggc gctggaggta 60 
cccactgatg gtaatgctgg cctgctggct gaaccccaga ttgccatgtt ctgtggcaga 120 
ctgaacatgc acatgaatgt ccagaatggg aagtgggatt cagatccatc agggaccaaa 180 
acctgcattg ataccaagga aggcatcctg cagtattgcc aagaagtcta ccctgaactg 240 
cagatcacca atgtggtaga agccaaccaa ccagtgacca tccagaactg gtgcaagcgg 300 
ggccgcaagc agtgcaagac ccatccccac tttgtgattc cctaccgctg cttagttggt 360 
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gaatttgtaa gtgatgccct tctcgttcct gacaagtgca aaitcttaca ccaggagagg 420 
atggatgttt qcgaaactca tcttcactgg cacaccgtcg ccaaagagac atgcagtgag 480 
aagagtacca acttgcatga ctacggcatg ttgci.gccct gcggaattga caagttccga 54 0 
ggggtagagt ttgtgtgttg cccactggct gaagaaagtg acaatgtgga ttctgctgat 600 
gcggaggagg atgactcgga tgtctggtgg ggcggagcag aracagacta tgcagatggg 660 
agtgaagaca aagtagtaga agtagcagag gaggaagaag tggctgaggr ggaagaagaa 720 
gaagccgatg atgacgagga cgatgaggat ggtgatgagg tagaggaaga ggctgaggaa 760 
ccctacgaag aagccacaga gagaaccacc agcactgcca ccaccaccac caccaccaca 840 
gagtctgtgg aagaggtggt tcgagaggtg tgctctgaac aagccgagac ggggccgtgc 900 
cgagcaacga tctcccgctg gtactttgat gtgactgaag ggaagtgtgc cccattcttt 960 
tacggcggat gtggcggcaa ccggaacaac tttgacacag aagagtactg catggccgtg 1020 
tgtggcagcg ccatgtccca aagtttactc aagactaccc aggaacctct tggccgagat 1080 
cctgttaaac tr.cctacaac agcagccagt acccctgatg ccgttgacaa gtatctcgag 1140 
acacccgggg atgagaatga acatgcccat ttccagaaag ccaaagagag gcttgaggcc 1200 
aagcaccgag agagaatgtc ccaggtcatg agagaatggg aagaggcaga acgtcaagca 1260 
aagaacttgc ctaaagctga taagaaggca gttatccagc acttccagga gaaagtggaa 1320 
tctttggaac aggaagcagc caacgagaga cagcagctgg tggagacaca canggccaga 1380 
gtggaagcca tgctcaatga ccgccgccgc ctggccctgg agaactacat caccgctctg 1440 
caggctgttc ctcctcggcc tcgtcacgtg ttcaatatgc taaagaagta tgtccgcgca 1500 
gaacagaagg acagacagca caccctaaag cattucgagc atgtgcgcat ggtggacccc 1560 
aagaaagccg ctcagatccg gtcccaggtt atgacacacc tccgtgtgat ttatgagcgc 1620 
atgaatcagt ctctctccct gctctacaac gtgcctgcag tggccgagga gattcaggat 1680 
gaagttgatg agctgcttca gaaagagcaa aactattcag atgacgtctt ggccaacatg 174 0 
attagtgaac caaggatcag ttacggaaac gacgctctca tgccatcttt gaccgaaacg 1800 
aaaaccaccg tggagctcct tcccgtgaat ggagagttca gcctggacga tctccagccg 1860 
tggcattctt ttggggctga ctctgtgcca gccaacacag aeaacgaagt tgagcctgtt 1920 
gatgcccgcc ctgctgccga ccgaggactg accactcgac caggttctgg gttgacaaat 1980 
atcaagacgg aggagatctc tgaagtuaag atggatgcag aattccgaca tgactcagga 204 0 
tatgaagttc atcatcaaaa attggtgttc tttgcagaag atgtgggttc aaacaaaggt 2100 
gcaatcattg gactcatggt gggcggtgtt gtcatagcga cagtgatcgt catcaccttg 2160 
gtgatgctga agaagaaaca gtacacatcc attcatcatg gtrgtggtgga ggttgacgcc 2220 
gctgtcaccc cagaggagcg ccacctgtcc aagatgcagc agaacggcta cgaaaatcca 2280 
acctacaagt tctttgagca gatgcagaac 2310 

^210> 55 
<211> 770 
<212> PRT 

<213> Homo sapiens 
<400> 55 

Met Leu Fro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
15 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 

Gin He Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 
50 55 60 

Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr lie Gin Asn 
85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 HO 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 



wo 01/49097 



-48- 



PCT/IBOl/00797 



Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 ' 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly lie 
165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 

Glu Ala Glu Glu Pro lyr Glu Glu Ala Thr Glu Arg Thr Thr Ser He 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Glu Val C>'s Ser Glu Gin Ala Glu Thr Gly Pro Cys Arg Ala Met He 
290 295 300 

Ser Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys Ala Pro Phe Phe 
305 310 315 320 

Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe Asp Thr Glu Glu Tyr 
325 330 335 

Cys Met Ala Val Cys Gly Ser Ala Met Ser Gin Ser Leu Leu Lys Thr 
340 345 350 

Thr Gin Glu Pro Leu Ala Arg Asp Pro Val Lys Leu Pro Thr Thr Ala 
355 360 365 

Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu Glu Thr Pro Gly Asp 
370 375 380 

Glu Asn Glu His Ala His Phe Gin Lys Ala Lys Glu Arg Leu Glu Ala 
385 390 395 400 

Lys His Arg Glu Arg Met Ser Gin Val Met Arg Glu Trp Glu Glu Ala 
405 410 415 

Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp Lys Lys Ala Val He 
420 425 430 

Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu Gin Glu Ala Ala Asn 
435 440 445 



Glu Arg Gin Gin Leu Val Glu Thr His Met Ala Arg Val Glu Ala Met 
450 455 460 
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Leu Asn Asp Arg Arg Arg Leu Aia 
465 470 

Gin Ala Val Pro Pro Arg Pro Arg 
485 

Tyr Val Arg Ala Giu Gin Lys Asp 
500 

Glu His Val Arg Met Val Asp Pro 
515 520 
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Leu Glu Asn Tyr He Thr Ala Leu 
475 480 

His Val Phe Asn Met Leu Lys Lys 
490 495 

Arg Gin His Thr Leu Lys His Phe 
505 510 

Lys Lys Ala Ala Gin He Arg Ser 
525 



Gin Val Met Thr His Leu Arg Val He Tyr Glu Arg Met Asn Gin Ser 
530 535 £40 

Leu Ser Leu Leu Tyr Asn Val Fro Ala Val Ala Glu Glu lie Gin Asp 
545 550 555 560 

Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn Tyr Ser Asp Asp Val 

565 570 575 

Leu Ala Asn Met He Ser Glu Pro Arg He Ser Tyr Gly Asn Asp Ala 
580 585 590 

Leu Met Pro Ser Leu Thr Glu Tnr Lys Thr Thr Val Glu Leu Leu Pro 
595 600 605 

Val Asn Gly Glu Phe Ser Leu. Asp Asp Leu Gin Pro Trp His Ser Phe 
610 615 620 

Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn Glu Val Glu Pro Val 
625 630 635 640 

Asp" Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr Thr Arg Pro Gly Ser 
645 650 655 

Gly Leu Thr Asn He Lys Thr Glu Glu I?.e Ser Glu Val Lys Met Asp 
660 665 670 

Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val His His Gin Lys Leu 

675 630 685 

Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys Gly Ala He He Gly 
690 695 700 

Leu Met Val Gly Gly Val Val He Ala Thr Val He Val He Thr Leu 
705 710 715 720 

Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He His His Gly Val Val 
725 730 735 

Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg His Leu Ser Lys Met 
740 745 750 

Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys Phe Phe Glu Gin Met 
755 760 765 



Gin Asn 
770 
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<210> 56 
<21i> 2253 
<212> DNA 

<213> Homo sapiens 
<400> 56 

atgctgcccg gtttggcact gctcccoctg gccgcctgga cggct.cgggc gctggaggta 60 

cccactgatg gtaacgctgg cctgctggct: gaaccccaga ttgccatgtt ctgtggcaga 120 

ctgaacatgc acatgaatgt ccagaatggg aag::gggatt cagatccatc agggaccaaa 180 

acccgcattg ataccaagga aggcaccccg cagtattgcc aagaagtcta ccctgaactg 240 

cagatcacca atgtggtaga agccaaccaa ccagtgacca tccagaactg gtgcaagcgg 300 

ggccgcaagc agtgcaagac ccatccccac tttgtgattc cctaccgctg ctcagttggt 360 

gagtttgtaa gtgatgcccr. tcccgcccct gacaagtgca aattcttaca ccaggagagg 420 

atggatgttt gcgaaactca tcttcacT:gg cacaccgtcg ccaaagagac atgcagtgag 4 80 

aagagtacca acttgcatga ctacggcatg ttgctgccct gcggaattga caagttccga 54 0 

ggggtagagt tligtgtgttg cccactggct gaagaaagtg acaatgtgga ttctgctgaL 600 

gcggaggagg atgactcgga tgtctggtgg ggcggagcag acacagacta tgcagatggg 660 

agtgaagaca aagcagtaga agtagcagag gaggaagaag tggctgaggt ggaagaagaa 720 

gaagccgatg atgacgagga cgatgaggat ggtgatgagg tagaggaaga ggctgaggaa 780 

ccctacgaag aagccacaga gagaaccacc agcattgcca ccaccaccac caccaccaca 840 

gagtctgtgg aagaggtggt tcgagaggtg tgctctgaac aagccgagac ggggccgtgc 900 

cgagcaatga tctcccgctg gtactttgat gtgactgaag ggaagtgtgc cccattcttt 960 

tacggcggat gnggcggcaa ccggaacaac tttgacacag aagagtactg catggccgtg 1020 

tgtggcagcg ccattcctac aacagcagcc agtacccctg acgccgttga caagtatctc 1080 

gagacaccLg gggatgagaa tgaacatgcc catttccaga aagccaaaga gaggcttgag 114 0 

gccaagcacc gagagagaat gtcccaggtc atga^agaat gggaagaggc agaacgtcaa 1200 

gcaaagaact cgcctaaagc tgataaqaag gcagttatcc agcatttcca ggagaaagtg 1260 

gaatctttgg aacaggaagc agccaac:gag agacagcagc tggtggagac acacatggcc 1320 

agagtggaag ccatgctcaa tgaccgccgc cgcctggccc tggagaacta catcaccgct 1380 

ctgcaggctg ctcctcctcg gcctcgccac gtgttcaata tgctaaagaa gtatgtccgc 1440 

gcagaacaga aggacagaca gcacacccta aagcatttcg agcatgtgcg catggtggat 1500 

ccc'aagaaag ccgctcagat ccggtcccag gttargacac acctccgtgt gatttatgag 1560 

cgcatgaatc agtctctctc cctgctctac aacgtgcccg cagtggccga ggagattcag 1620 

gat^aagttg atgagctgct tcagaaagag caaaactatt cagatgacgt cttggccaac 1680 

atgattagtg aaccaaggat cagttacgga aacgatgctc tcatgccatc tttgaccgaa 1740 

acgaaaacca ccgtggagct ccttcccgtg aatggagagt tcagcctgga cgatctccag 1800 

ccgtggcatt cttttggggc tgactctgtg ccagccaaca cagaaaacga agttgagcct 1860 

gttgatgccc gccctgctgc cgaccgagga ctgaccactc gaccaggttc tgggttgaca 1920 

aatatcaaga cggaggagat ctctgaagtg aagatggatg cagaattccg acatgactca 1980 

ggatatgaag ttcatcatca aaaattggtg ttctttgcag aagatgtggg ttcaaacaaa 2040 

ggtgcaatca ttggactcat ggtgggcggt gttgtcatag cgacagtgat cgtcatcacc 2100 

ttggtgatgc tgaagaagaa acagtacaca tccaetcatc atggtgtggt ggaggttgac 2160 

gccgctgtca ccccagagga gcgccacctg tccaagatgc agcagaacgg ccacgaaaat 2220 
ccaacctaca agttctttga gcagatgcag aac 2253 

<210> 57 
<21i> 751 
<212> PRT 

<213> Homo sapiens 
<400> 57 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Aia Trp Thr Ala Arg 
15 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 



Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 
50 55 60 
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Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 
85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 110 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 



Val Pro Asp Lys Cys Lys Phe Leu His GJn Glu Arg Met Asp Val Cys 
130 135 I'iO 

Glu Thr His Leu His Trp His Thr Val AJ a Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 
165 170 : 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Giu Glu Asp Asp Ser Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 22: 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 

245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser He 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Glu Val Cys Ser Glu Gin Ala Glu Thr Gly Pro Cys Arg Ala Met He 
290 295 300 

Ser Arg Trp Tyr Phe Asp Val Thr Glu Giy Lys Cys Ala Pro Phe Phe 
305 310 315 320 

Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe Asp Thr Glu Glu Tyr 
325 330 335 

Cys Met Ala Val Cys Gly Ser Ala He Pro Thr Thr Ala Ala Ser Thr 
340 345 350 

Pro Asp Ala Val Asp Lys Tyr Leu Glu Thr Pro Gly Asp Glu Asn Glu 

355 360 365 

His Ala His Phe Gin Lys Ala Lys Glu Arg Leu Glu Ala Lys His Arg 
370 375 380 

Glu Arg Met Ser Gin Val Met Arg Glu Trp Glu Glu Ala Glu Arg Gin 
385 390 395 400 
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Ala Lys Asn Leu Pro Lys Ala Asp Lys Lys Ala Val lie Gin His Phe 
405 410 415 

Gin Glu Lys Val Glu Ser Leu Glu Gin Glu Ala Ala Asn Glu Arg Gin 
420 425 430 

Gin Leu Val Glu Thr His Met Ala Arg Val Glu Ala Met Leu Asn Asp 
435 440 445 

Arg Arg Arg Leu Ala Leu Glu Asn Tyr lie Thr Ala Leu Gin Ala Val 
450 455 460 

Pro Pro Arg Pro Arg His Val Phe Asn Met Leu Lys Lys Tyr Val Arg 
465 470 475 480 

Ala Glu Gin Lys Asp Arg Gin His Thr Leu Lys His Phe Glu His Val 
485 490 495 

Arg Met Val Asp Pro Lys Lys Ala Ala Gin lie Arg Ser Gin Val Met 
500 505 510 

Thr His Leu Arg Val He Tyr Glu Arg Met Asn Gin Ser Leu Ser Leu 
515 520 525 

Leu Tyr Asn Val Pro Ala Val Ala Glu Glu He Gin Asp Glu Val Asp 
530 535 540 

Glu Leu Leu Gin Lys Glu Gin Asn Tyr Ser Asp Asp Val Leu Ala Asn 
545 550 555 560 

Met He Ser Glu Pro Arg He Ser Tyr Gly Asn Asp Ala Leu Met Pro 

565 570 575 

Ser Leu Thr Glu Thr Lys Thr Thr Val Glu Leu Lea Pro Val Asn Gly 
580 585 590 

Glu Phe Ser Leu Asp Asp Leu Gin Pro Trp His Ser Phe Gly Ala Asp 
595 600 605 

Sei Val Pro Ala Asn Thr Glu Asn Glu Val Glu Pro Val Asp Ala Arg 
610 615 620 

Pro Ala Ala Asp Arg Gly Leu Thr Thr Arg Pro Gly Ser Gly Leu Thr 
625 630 635 640 

Asn He Lys Thr Glu Glu He Ser Glu Val Lys Met Asp Ala Glu Phe 
645 650 655 

Arg His Asp Ser Gly Tyr Glu Val His His Gin Lys Leu Val Phe Phe 
660 665 670 

Ala Glu Asp Val Gly Ser Asn Lys Gly Ala He lie Gly Leu Met Val 

675 680 685 

Gly Gly Val Val He Ala Thr Val He Val He Thr Leu Val Met Leu 
690 695 700 

Lys Lys Lys Gin Tyr Thr Ser He His His Gly Val Val Glu Val Asp 
705 710 715 720 

Ala Ala Val Thr Pro Glu Glu Arg His Leu Ser Lys Met Gin Gin Asn 
725 730 735 
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Gly Tyr Glu Asn Pro Thr Tyr Lys Phe Phe Glu Gin Met Gl;i Asn 
740 745 750 



<210> 58 
<212> 2316 
<212> DNA 

<213> Homo sapiens 
<400> 58 

atgctgcccg gtttggcact gctcctgctg gccgcctgga cggctcgggc gctggaggta 60 

cccactgatg gtaatgctgg cctgctogct gaaccccaga ttgccatgtt ctgtggcaga 120 

ctgaacatgc acatgaangt ccagaacggg aagtgggatt caqatccatc agggaccaaa 180 

accLgcattg araccaagga aggcatcctg cagtattgcc aagaagtcta ccctgaactg 240 

cagatcacca atgtggtaga agccaaccaa ccagcgacca tccagaactg gtgcaagcgg 300 

ggccgcaagc agtgcaagac ccacccccac tttgt-gattc cctaccgctg cttagttugt 360 

gagtttgtaa gtgatgccct tctcgtr.ccr gacaagtgca aattctcaca ccaggagagg 420 

atggatgtti: gcgaaactca ccttcactgc cacaccgtcg ccaaagagac atgcagtcag 4 80 

aagagtacca acttgcatga ctacggcatg ttgctgccct gcggaactga caagttccga 540 

ggggtagagt ttgtgtgttg cccactggcc gaagaaagtg acaatgcgga ttctgctgat 600 

gcggaggagg augactcgga tgtctggtgg ggcggagcag acacagacta tgcagatggg 660 

agtgaagaca aagtagtaga agtagcagag gaggaagaag tcgctgaggt ggaagaagaa 720 

gaagccgatg atgacgagga cgatgaggat ggtgatgagg tagaggaaga ggctgaggaa 780 

ccctacgaag aagccacaga gagaaccacc agcattgcca ccaccaccac caccaccaca 84 0 

gagtctgtgg aagaggtggt tcgagaggtg tgctctgaac aagccgagac ggggccgtgc 900 

cgagcaatga cctcccgctg gtactttgat gtgactgaag ggaagtgtgc cccattctr.t 960 

tacggcggat gtggcggcaa ccggaacaac tttgacacag aagagtactg catggccgr.g 1020 

t.gtggcagcg ccatgtccca aagtttactc aagactaccc aggaacctct tggccgagat 1080 

cctgttaaac tccctacaac agcagccagt acccctgatg ccgtt.gacaa gtatctcgag 1140 

acacctgggg atgagaatga acatgcccat ttccagaaag ccaaagagag gcttgaggcc 1200 

aagcaccgag agagaatgtc ccaggtcatg agagaatggg aagaggcaga acgtcaagca 1260 

aacaacttgc ctaaagctga taagaaggca gttatccagc atttccagga gaaagtggaa 1320 

tctttggaac aggaagcagc caacgagaga cagcagctgg tggagacaca catggccaga 1380 

gtggaagcca tgctcaatga ccgccgccgc ctggccctgg agaactacat caccgctccc 1440 

.caggctgttc ctcctcggcc tcgtcacgtg ttcaatatgc taaagaagta tgtccgcgca 1500 

gaacagaagg acagacagca caccctaaag catttcgagc atgtgcgcat ggtggatccc 1560 

aagaaagccg ctcagatccg gtcccaagtt atgacacacc tccgtgtgat ttargagcgc 1620 

atgaatcagt c»:ctctccct gctctacaac gtgcctgcag tggccgagga gattcaggat 1680 

gaagttgatg agctgcttca gaaagagcaa aactattcag atgacgtctt ggccaacatg 1740 

attagtgaac caaggaccag ttacggaaac gatgctctca tgccatcttt gaccgaaaca 1800 

aaaaccaccg tggagctcct tcccgtgaat ggagagttca gcctggacga tctccagccg 1860 

tggcattctt ttggggctga ctctgtgcca gccaacacag aaaacgaagt tgagcctgtt 1920 

gatgcccgcc ctgctgccga ccgaggactg accactcgac caggttctgg gttgacaaat 1980 

atcaagacgg aggagatctc tgaagtgaag atggatgcag aattccgaca tgactcagga 2040 

tatgaagttc atcatcaaaa attggtcttc tttgcagaag atgtgggttc aaacaaaggt 21C0 

gcaatcattg gactcatggt gggcggtgtt gtcat.agcga cagcgatcgt catcaccttg 2160 

gtgatgctga agaagaaaca gtacacatcc attcatcatg gtgtggtgga ggttgacgcc 2220 

gctgtcaccc cagaggagcg ccacctgtcc aagacgcagc agaacggcta cgaaaatcca 2280 

acctacaagt cctttgagca gatgcagaac aagaag 2316 

<210> 59 
<211> 772 
<212> PRT 

<213> Homo sapiens 
<400> 59 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
1 5 10 • 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 
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Gln lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 
50 55 60 

Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 

85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 110 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 

115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 
165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Vai Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser He 

260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 



Glu Val Cys Ser Glu Gin Ala Glu Thr Gly Pro Cys Arg Ala Met He 
290 295 300 



Ser Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys Ala Pro Phe Phe 
305 310 315 320 

Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe Asp Thr Glu Glu Tyr 
325 330 335 

Cys Met Ala Val Cys Gly Ser AJa Met Ser Gin Ser Leu Leu Lys Thr 
340 345 350 

Thr Gin Glu Pro Leu Ala Arg Asp Pro Val Lys Leu Pro Thr Thr Ala 
355 360 365 
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Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu Glu Thr Pro Gly Asp 

370 375 380 

Glu Asn Glu His Ala His Phe Gin Lys Ala Lys Glu Arg Leu Glu Ala 
385 390 395 400 

Lys His Arg Glu Arg Met Ser Gin Val Met Arg Glu Trp Glu Glu Ala 
405 410 415 

Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp Lys Lys Ala Val lie 
420 425 430 

Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu Gin Glu Ala Ala Asn 
435 440 445 

Glu Arg Gin Gin Leu Val Glu Thr His Met Ala Arg Val Glu Ala Met 

450 455 460 



Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn Tyr He Thr Ala Leu 
465 470 475 480 

Gin Ala Val Pro Pro Arg Pro Arg His Val Phe Asn Met Leu Lys Lys 
485 490 495 

Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His Thr Leu Lys His Phe 
500 505 510 

Glu His Val Arg Met Val Asp Pro Lys Lys Ala Ala Gin He Arg Ser 
515 520 525 

Gin Val Met Thr His Leu Arg Val He Tyr Glu Arg Met Asn Gin Ser 
530 535 540 

Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala Glu Glu He Gin Asp 
545 550 555 560 

Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn Tyr Ser Asp Asp Val 
565 570 575 

Leu Ala Asn Met He Ser Glu Pro Arg He Ser Tyr Gly Asn Asp Ala 
580 585 590 

Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr Val Glu Leu Leu Pro 
595 600 605 

Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin Pro Trp His Ser Phe 
610 615 620 

Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn Glu Val Glu Pro Val 
625 630 635 640 

Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr Thr Arg Pro Gly Ser 
645 650 655 

Gly Leu Thr Asn He Lys Thr Glu Glu He Ser Glu Val Lys Met Asp 
660 665 670 

Ala Glu Phe Arg His Asp Ser Gly Tyr Gju Val His His Gin Lys Leu 
675 680 685 



Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys Gly Ala He He Gly 
690 695 700 
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Leu Met Val Gly Gly Val Vai He 
705 710 

Vai Met Leu Lys Lys Lys Gin Tyr 
725 

Glu Val Asp Ala Ala Val Thr Pro 
740 



Ala Thr Val lie Val He Thr Leu 

715 720 

Thr Ser He His His Gly Val Val 
730 735 

Glu olu Arg His Leu Ser Lys Met 
745 750 



Gin Gin Asn Gly Tyr Glu Asn Fro Thr Tyr Lys Phe Phe Glu Gin Met 
755 760 765 

Gin Asn Lys Lys 
770 



<21C> 60 

<211> 2259 

<2i2> DNA 

<213> Homo sapiens 

<400> 60 

atgctgcccg gtttggcact gctcctgctg 
cccactgatg gtaatgctgg cctgctggct 
ctgaacatgc acatgaatgt ccagaatggg 
acctgcattg ataccaagga aggcatcctg 
cagatcacca atgtggtaga agccaaccaa 
ggccgcaagc agtgcaagac ccatccccac 
gagtttgtaa gtgatgccct tctcgttcct 
atggatgttt gcgaaactca tcttcartgg 
aagagtacca acttgcatga ctacggcatg 
ggggtagagt ttgtgtgttg cccactggct 
gcggaggagg atgactcgga tgtctggtgg 
agtgaagaca aagtagtaca agtagcagag 
gaagccgacg atgacgagga cgatgaggat 
ccctacgaag aagccacaga gagaaccacc 
gagtctgtgg aagaggtggt tcgagaggtg 
cgagcaatga tctcccgctg gtactttgat 
tacggcggat gtggcggcas ccggaacaac 
tgtggcagcg ccattcctac aacagcagcc 
gagacacctg gggatgagaa tgaacatgcc 
gccaagcacc gagagagaat gtcccaggtc 
gcaaagaact rgcctaaagc tgataagaag 
gaatctttgg aacaggaagc agccaacgag 
agagtggaag ccatgctcaa tgaccgccgc 
ctgcaggctg ttcctcctcg gcctcgtcac 
gcagaacaga aggacagaca gcacacccta 
cccaagaaag ccgctcagat ccggtcccag 
cgcatgaatc agtctctctc cctgctctac 
gatgaagttg atgagctgct tcagaaagag 
atgattagtg aaccaaggat cagttacgga 
acgaaaacca ccgtggagct ccttcccgtg 
ccgtggcatt cttttggggc tgactctgtg 
gttgatgccc gccctgctgc cgaccgagga 
aatatcaaga cggaggagat ctctgaagtg 
ggatatgaag ttcatcatca aaaattggtg 
ggtgcaatca ttggactcat ggtgggcggt 
ttggtgatgc tgaagaagaa acagtacaca 
gccgctgtca ccccagagga gcgccacctg 
ccaacctaca agttctttga gcagatgcag 



gccgcctgga cggctcgggc gctggaggta 60 
gaaccccaga ttgccatgtt ctgtggcaga 120 
aagtgggatt cagatccatc agggaccaaa 180 
cagtattgcc aagaagtcta ccctgaactg 240 
ccagtgacca tccagaactg gtgcaagcgg 300 
tttgcgattc cctaccgctg cttagttggt 360 
gacaagtgca aattcttaca ccaggagagg 420 
cacaccgtcg ccaaagagac atgcagtgag 480 
ttgctgccct gcggaattga caagttccga 540 
gaagaaagtg acaatgtgga ttctgctgat 6C0 
ggcggagcag acacagacta tgcagatggg 660 
gaggaagaag tggctgaggt ggaagaagaa 720 
ggtgatgagg tagaggaaga ggctgaggaa 780 
agcattgcca ccaccaccac caccaccaca 84 0 
tgctctgaac aagccgagac ggggccgtgc 900 
gtgactgaag ggaagtgtgc cccattcttt 960 
tttgacacag aagagtactg catggccgtg 102C 
agtacccctg atgccgttga caagtatctc 1080 
catttccaga aagccaaaga gaggcttgag 1140 
atgagagaat gggaagaggc agaacgtcaa 1200 
gcagttatcc agcatttcca ggagaaagtg 1260 
agacagcagc tggtggagac acacatggcc 1320 
cgcctggccc tggagaacta catcaccgct 1380 
gtgttcaata tgctaaagaa gtatgtccgc 1440 
aagcatttcg agcatgtgcg catggtggat 1500 
gttatgacac acctccgtgt gatttatgag 1560 
aacgtgcctg cagtggccga ggagattcag 1620 
caaaactatt cagatgacgt cttggccaac 1680 
aacgacgctc tcatgccatc tttgaccgaa 1740 
aatggagagt tcagcctgga cgatctccag 1800 
ccagccaaca cagaaaacga agttgagcct 1860 
ctgaccactc gaccaggttc tgggttgaca 1920 
aagatggatg cagaattccg acatgactca 1980 
tcctttgcag aagatgtggg ttcaaacaaa 2040 
gttgtcatag cgacagtgat cgtcatcacc 2100 
tccattcatc atggtgtggt ggaggttgac 2160 
tccaagatgc agcagaacgg ctacgaaaat 2220 
aacaagaag 2259 
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<210> 61 
<21i> 753 
<212> PRT 

<213> Homo sapiens 
<400> 61 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
15 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Giy Thr Lys Thr Cys lie Asp 

50 ' 55 60 

Thr Lys Glu Gly lie Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gill He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 

85 90 95 

Tro Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 110 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val C^'s 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 
165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 

195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Giy Ser Glu Asp Lys 
210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser He 
260 265 270 



Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 



Glu Val Cys Ser Glu Gin Ala Glu Thr Gly Pro Cys Arg Ala Met He 
290 295 300 
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Ser Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys Ala Pro Phe Phe 
305 310 315 320 

Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe Asp Thr Glu Glu Tyr 
325 330 335 

Cys Met Ala Val Cys Gly Ser Ala He Pro Thr Thr Ala Ala Ser Thr 
340 345 350 

Pro Asp Ala Val Asp Lys Tyr Leu Glu Thr Pro Gly Asp Glu Asn Glu 
355 360 365 

His Ala His Phe Gin Lys Ala Lys Glu Arg Leu Glu Ala lys His Arg 
370 375 380 

Glu Arg Met Ser Gin Val Met Arg Glu Trp Glu Glu Ala Glu Arg Gin 
385 390 395 400 

Ala Lys Asn Leu Pro Lys Ala Asp Lys Lys Ala Val He Gin His Phe 
405 410 415 

Gin Glu Lys Val Glu Ser Leu Glu Gin Glu Ala Ala Asn Glu Arg Gin 
420 425 430 

Gin Leu Val Glu Thr His Met Ala Arg Val Glu Ala Met Leu Asn Asp 
435 440 445 

Arg Arg Arg Leu Ala Leu Glu Asn Tyr He Thr Ala Leu Gin Ala Val 
450 455 460 

Pro Pro Arg Pro Arg His Val Fhe Asn Met Leu Lys Lys Tyr Val Arg 
465 470 475 480 

Ala Glu Gin Lys Asp Arg Gin His Thr Leu Lys His Phe Glu His Val 
485 490 495 

Arg Met Val Asp Pro Lys Lys Ala Ala Gin He Arg Ser Gin Val Met 
500 505 510 

Thr His Leu Arg Val He Tyr Glu Arg Met Asn Gin Ser Leu Ser Leu 
515 520 525 

Leu Tyr Asn Val Pro Ala Val Ala Glu Glu He Gin Asp Glu Val Asp 
530 535 540 

Glu Leu Leu Gin Lys Glu Gin Asn Tyr Ser Asp Asp Val Leu Ala Asn 
545 550 555 560 



Met He Ser Glu Pro Arg He Ser Tyr Gly Asn Asp Ala Leu Met Pro 

565 570 575 

Ser Leu Thr Glu Thr Lys Thr Thr Val Glu Leu Leu Pro Val Asn Gly 
580 585 590 

Glu Phe Ser Leu Asp Asp Leu Gin Pro Trp His Ser Phe Gly Ala Asp 
595 600 605 

Ser Val Pro Ala Asn Thr Glu Asn Glu Val Glu Pro Val Asp Ala Arg 
610 615 620 

Pro Ala Ala Asp Arg Gly Leu Thr Thr Arg Pro Gly Ser Gly Leu Thr 
625 630 635 640 
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Asn lie Lys Thr Glu Glu lie Ser 
645 

Arg His Asp Ser Gly Tyr Glu Val 
660 

Ala Glu Asp Val Gly Ser Asn Lys 

675 680 

Gly Gly Val Val lie Ala Thr Val 
690 695 

Lys Lys Lys Gin Tyr Thr Ser lie 
705 710 

Ala Ala Val Thr Pro Glu Glu Arg 
725 

Gly Tyr Glu Asn Pro Thr Tyr Lys 
740 

Lys 



-59- 

Glu Val Lys Met Asp Ala Glu Phe 

650 655 

His His Gin Lys Leu Val Phe Phe 
665 ' 670 

Gly Ala lie lie Gly Leu Met Val 
685 

lie Val lie Thr Leu Val Met Leu 
700 

His His Gly Val Val Glu Val Asp 
715 720 

His Leu Ser Lys Met Gin Gin Asn 
7i0 735 

Phe Phe Glu Gin Met Gin Asn Lys 
745 750 



<210> 62 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
<400> 62 

Leu Glu Val Leu Phe Gin Gly Pro 
1 5 



<210> 63 
<2H> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
<400> 63 

Ser Glu Val Asn Leu Asp Ala Glu Phe Arg 
1 . 5 iO 



<210> 64 
<211> 10 
<2i2> PRT 

<213> Artificial Sequence 
c220> 

<223> Description of Artificial Sequence- synthetic 
c400> 64 

Ser Glu Val Lys Met Asp Ala Glu Phe Arg 
15 10 
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<210> 65 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
<4O0> 65 

Arg Arg Gly Gly Val Val He Ala Thr Val He Val Gly Glu Arg 
15 10 15 



<210> 66 
<211> 4 
<212> PRT 

<2i3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peptide 

<4C0> 66 
Asn Leu Asp Ala 
1 



<210> 67 
<211> 8 
<2i2> PRT 

<213> Artificial Sequence 

<220> 

c223> Description of Artificial Sequence: Peptide 
<400> 67 

Glu Val Lys Met Asp Ala Glu Phe 
1 5 



<210> 68 
<211> 5 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Peptide 
<400> 68 

Gly Arg Arg Gly Ser 
1 5 



<210> 69 
<211> 6 
<212> PRT 

<:213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peptide 



<400> 69 
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Thr Gin His Gly lie Arg 
I 5 



<210> 70 
<211> 6 
<212> PRT 

<2i3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peptide 
<400> 70 

Glu Thr Asp Glu Glu Pro 
] 5 



<210> 71 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<40D> 7: 

Met Cys Ala Glu Val Lys Met Asp 
1 5 

<210> 72 
<211> 5 
<212> PRT 

<21i> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 72. 

Asp Ala Glu Phe Arg 

1 5 



Sequence : Pept ide 

Ala Glu Phe Lys Asp Asn Pro 
10 15 

Sequence: synthetic 



<210> 73 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
<400> 73 



Ser Glu Val Asn Leu 
1 5 



'i 
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